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2 PROFESSOR K. PEARSON ON THE INFLUENCE OF NATURAL 

(1.) Introductory. On the Influence of Selection upon Correlation. 

At an earlier stage in the development of the statistical theory of evolution it 
was suggested that the coefficient of correlation (Galton's function) might be found 
constant for all races of the same species— in fact, it was considered possible that this 
coefficient might be the long-sought-for criterion of identity in species. Professor 
Weldon, following up this suggestion of Mr. Galton's, then made the elaborate 
series of measurements on crabs with which his name will always be closely associated. 
To a first approximation these researches seemed to confirm the possibility of 
Galton's function being a true criterion of species. When, however, a finer 
mathematical test was applied to Professor Wei/don's observations as well as to 
other statistical series for organs in man;"' it became clear that the coefficient of 
correlation varied from local race to local race, and could not be used as a criterion 
of species. A slight investigation undertaken in the summer of 1896 convinced me 
that the coefficient of correlation between any two organs, is just as much peculiar and 
characteristic of a local race as the means and variations of those organs. In fact, 
if local races be the outcome of natural selection, then their coefficients of correlation 
must in general differ. The object of the present paper is to show, not only that 
natural selection must determine the amount of correlation, but that it is probably 
the chief factor in the production of correlation. If selection, natural or artificial, 
be capable of producing correlation, then it seems impossible to regard all correlation 
as evidence of a causal nexus, t although the converse proposition that all causal 
nexus denotes correlation, is undoubtedly the most philosophical method of regarding 
causality. 

In dealing with the influence of selection on correlation, I shall suppose the distri- 
bution of complex groups of organs to follow the normal correlation surface — the 
generalised Gaussian law of frequency. I shall further assume the selection surfaces 
to be normal in character. Neither of these assumptions is absolutely true, but the 
Gaussian law in a good many cases describes the frequency sufficiently closely to 
enable us to obtain fair numerical results by its application. Probably in all cases it 
will enable us to reach qualitative if not accurate quantitative theoretical deductions. 
I have the less hesitation in asserting this, as Mr, G. U. Yule has recently succeeded 
in deducing; the chief formulae for correlation and regression as given by the 
Gaussian law from general principles, which make no appeal to a special law of 
frequency.^ 

* " Mathematical Contributions to the Theory of Evolution .—III.," 'Phil. Trans./ A, vol 189, pp. 266 
and 280. See also E. Warren, "Variation in Portunus depwator" s Eoy. Soc. Proc.,' vol. 60, 
pp. 233-4. 

t See a series of letters in i Nature,' vol. 54, 1896, arising from a discussion upon a paper by A. E. 
Wallace, 

J 'Eoy. Soc, Proc./ vol. 60, p. 477. 
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In particular, if a selected group be not given with very great accuracy by a 
normal frequency surface, still we may, I think, consider ourselves justified in 
supposing that the effects of the actual selection, and those of a normal selection 
with the same means, the same amounts of variation, and with correlations of the 
same intensity will be at least qualitatively alike in character, if they be not indeed 
exactly the same quantitatively within the limits fixed by the probable errors of the 
constants. 

My plan in this memoir will be as follows : — I shall first state the fundamental 
theorem in multiple correlation with a new proof, so that the formulae required may 
be once for all collected for reference. # I shall then give the algebraic investigation 
of the new formulae for selection. I shall afterwards consider special simple cases, 
and illustrate them by examples. Finally I shall draw attention to the nature of 
the selective death-rate as indicated in cases of this kind, and consider at length its 
algebraic theory. Throughout I shall endeavour to illustrate the somewhat complex 
algebra by arithmetical examples. 



(2.) On the Fundamental Theorem in Multiple Correlation. 

I have shown in my memoir on " Regression, Heredity, and Panmixia" (' Phil. 
Trans./ A, vol. 187, p. 261) that if the n variables of a complex be functions of m 
(m > n) independent variables with frequency distributions following the normal law, 
and such that the principle of superposition holds for the deviations from the means 
supposed small ; then the frequency of the complex with deviations from the 
means of the n variables lying between x x and x 1 + Sx l9 x 2 and x 2 + Sx 2 . . . x n 
and x n + 8x H will be z 8x l Sx 2 . . . Sx n where z : 

z = z expt. - I (S x (c pp x pp 2 ) + 28 2 (c M x p x c/ ) )..,'.. (i.). 

Here z Q , c m c pq . . . are constants, and S x denotes a summation for every value of p, 
and S 2 for every pair of values of p and q in the series from 1 to n, 

In the same memoir (p. 302) I have determined the values of z Q , c m c pq in terms of 
the correlations v pq and the standard deviations <r p and a q of the n variables. This 
point had already been considered by Professor Edgeworth ('Phil. Mag./ vol. 34, 
p. 201, 1892), and some further results by Mr. A. Black, reached before his death in 
1893, were published in the < Camb. Phil. Trans/ (vol. 16, p. 219, 1897). The 
present investigation is, I think, novel, and adds to results already reached others 
required in the present memoir, so that it thus places together with a fairly simple 
proof all the fundamental results to which I shall have occasion to appeal later. 

* We have used these formulae for several years, but they do not appear to have been hitherto published 
in a collected form. 
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Let us consider the quadric of the n m order 

Q = S x (cppxj 1 ) + 2S 2 (c pq x p x f/ ) = constant, 

and fix our attention on two of the variables, say the first two, x l and x % . If these 
be considered constants, the quadric of the remaining n — 2 variables will not now 

be referred to its " centre." But its centre, # 3 , x i{ , . . x fl9 will be given by the 
equations : 



c u x x + c 24 ^ 2 + c u x 3 + c 44 x 4 + . 



"1 ^nffin "^ ^' 



c ln x 1 + c 2w a? 2 + c 3 ,^3 + c 4 „a: 4 + . 



**~™°™ / s * \ 



The following expressions will not be zero, but will be written a and fS : 



CnXj + C 21 # 2 + C 31 ,X 3 + C 41 X 4 + 
Cl2®l + C 22 X 2 + C 32 ^ 3 -f C 42 X4 + 



Now, if A be the discriminant, 



+ o nl x H = a. 

1" &n ( 2p^u ~~~" r^* * • • (111. I. 



C ll? ^12? C 1Sj 
^21? ^22? ^23? 



9 * 



» * 



a 



%1> ^#2? 



C*i 



c 



lw 



c. 



2« 



^31? ^32) C 33? ^3 



m 



* « 



6 w3 , . . , 6 WW 



. . (iv.). 



and C pq the minor corresponding to c pp we have by solving the above n linear 
equations, 



JU~\ 



*C n + /3C 



13 



tA/C 



^Oj2 "^ P^22 



whence : 



Generally also : 



a 



A (^1^22 ^2^ 12) 
C n C 22 ™ C i2 3 



«A/ 



«C W -f j3Q m 



c i 



A 



£ 



ZA (^9^11 "~"° ^1^1 0./ 



1 2 



^11^22 ^12 



(v.). 



(^1^23 ~~ Gzfin) x i + (^n^m ~~ - C Ky C 12 )^ 3 



Oi i Or 



11^22 



3 

^12 



L'i^^i *t" -*-^2<^2? ^y? * 



(vi.). 
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This determines the central co-ordinate for any variable x q for a given value of 



JUi CaiLxkX Jba, 



Now let us transfer the quadric to x& os 4 . . . a?„ as origi 



Cj ^ = 1 #7 \£]^l "T" ^2^2 "T* C 3^3 



«M > » 



Qiq=n (~ i / \ ( G lq X l I C 2^2 "T~ C 3f^3 "t" 

&q=3 (#£ + ^ ;7 M ~ i t / , 

\ *T" C 3<?^ 3 "T" 

I Ci\Xi -J- (^21^2 T" C 31^3 I 

+ c 81 af 8 + 



V 



+ a? 2 



^12^1 "T C 22^2 "t" ^32^3 "T" 

I G S2 X 3 H~ 



n. It may be written 



I V n qX n ] 



— p C n qX n \ 
i" tuqJU n ) 

-p C U 2X n 



Making use of the w linear equations (ii.) and (hi.) : 



Q 



^ = 3 \^V I *^ 27 (^3^ 3 I C±qX 4 . "T • • • "I G nq X n) 

~T ^1 ( a T~ c 3l^ 3 • C 41^ 4 "T • • • "T~ G n\ x n) 

T" X % \P "T c 32^ 3 ~T C 42*^ 4 T • • • *T c n^ x n) 

y^q-Z X q \^Sq X 3 I ^4// X 4 ~T" • • • "T" &nqX u) 

+ aajj + /3.x 2 



(vii.). 



For arranging vertical columns in rows, the remaining terms are 

X 3 (C^X-l + %2'% 4" ^2 = 3 \ G %q X q)) 
"T X 4 (%L^1 « C 42^3 I ^2=3 ( C 4$^7/ 



each line of which vanishes by the equations (ii.) for the centre. 

Accordingly : 

Q = Q' + * x i + /&r 2? 

where Q ; is a quadratic function of x ; 3? aj' 4 , . . . x' m not involving x x and # a at all. 

Hence: z = z e~ m ' + ax ' + ^\ 

Now integrate z with respect to all the variables x'. 6 , x\, . . . x' n from — oo to + °° ? 
keeping a^ and a? 3 constant. 

Then, although the origin is a function of x 1 and <r 2 , 



+ oo /* + co /* + cc 



<vf|t/ OttA, oOt'i/C' ^ . • • XA/X /ft 



■oo 



J —GO J —CO 



4 



cannot involve x x and ^ 3 but only c sp c^ q . . . c ngi &c. ; let the result be £ . Then : 



Z =r £ g-i(««i + /&» 2 ) 



fi 
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must be the correlation surface for x Y and x^ for all values of x& x^ . . . x n . We may 
write it in the form 

v 3f! — 90 T T 4- T 3 
'■0\ v/oo *-j ^ i o'-'-'i '^'o n^ ^1 1 tA - y 



(J 



4o sxpt. 



o i<A 



■i 9 



P — * 



. (viii.). 



Comparing this with the known form of the correlation surface for two variables, 4 



# 



N 



, 3 



z 



L (V A. tf 



I expt. 



i 

2^ 



x 



~f l 

1 — r 10 " \0\ 



n 

/v> A 



11: 



G" } (To <7 " 



« (IX. ), 



we have at once 



OtiOqo ~~ 



< 2 



GTi 



12 



iV-^OO 



cr/(l — r 12 2 ) 



clc 



ll v ^23 



O 3 

°12 



.(■: 



ii 



^'12^1^*2 



vV i o / ZA 



Whence 



12 



c 



12, 



11^22 



(x.). 
(xi.), 






2 



Cii/A, 



(TV 





A) 



\JcyC)j aA. ....... J XII. j. 



Or, generally : 



7^ ~_— \Jpql V Vy^l^ 



'pp^qq 



* « « a 8 a 



Oj,: 



v ^~ J lim ^ * 



« 9 



• 68 



. (xiv.). 



Thus correlations and variations are fully determined in terms of the discriminant 

and its minor tor riie constants Cii, ^22? ^12 • 8 • ^W?> ^W? ^W? * * * ^wa'« 

We have next the inverse proposition to find the c's in terms of the r's and cr's. 
, We have, by well-known propositions in the theory of determinants : 



CuCji + c 12 C 12 + c 13 C 13 + 

^llC 2 i + C 12 C 22 + ^13^23 + 
^11^31 + <?12^32 + C 13^33 + 



+ c lw C ltt = A, 

+ G Vl C 2l „ = 0. 
J™ n P — 



Or, 



^llQu + O l2 G n2 + C 13 C //3 + 

Cn^"i 2 + c^r^crxGrg + c 13 r 13 cr 1 (j3 + . 

^ii r ^icr 1 (r 2 + c 12 cr 2 2 + c 13 r 23 cr 2 cr 3 + . 

011^310-10-3 + C 12 r 32 or 2 cr 8 + C 13 cr 3 2 + . 



+ c lu C m — 0. 



Cn^io-!^ + c 12 r n2 (ro(r n + c 1: 3^3^^ + 



Hence solving 



c 



11 



Sn/S, 



I G\nVn 



2 



Ci,7 — - UW& 



'17 



1?/ 



0. 



(xv.) 



* 



Pearson, 'Phil. Trans./ A, vol. 187, p. 264. 
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where 



s 



<> 


n 2 ow2 ? 


^13^1 °"3) 


r2i<Ticr 2 , 


0'2> 


^23^2^8? 


^l^CTg, 


^]20" 2 CT 3? 


2 
a~ 3 , 



s • 



^";/l^a°"w5 ^^/2 Cr 2 or «? ^'«3^"3 cr «) 



^2« °"2 <?"// 



cr 



w 



and S^ is the minor corresponding to the constituent T n <r v <j q * 



But 



Bo 9 o 9T> 



S 



.^/ 



l w 2 u 3 

.222 „ 2~D 



where R is the determinant, 



J? '^12? ^13 j 



j 7 *23? 



^21? 

?*31? ^325 '- 



• 9 



»•••«* 



?\/l, ? 



«2? 



«3) 



' 1« 
3« 



7* 



• • 



A. V X. I , 



and H /;(/ is the minor corresponding to the constituent r pr Thus we have 



1 






1? 



°1°"* 



XXn^/XX; » . 



or, generally, 



1 E 



c 



'pp 



pp 



c 



M 



0-^O-y 



R/T3 



• 9 



Thus % may be written" 



'A- 









'0 



(xvii.). 



(xviii.). 






It remains to determine z a from the fact that the volume of the surface = N. 



Or, 



N 



r -!- x 



'0 



-fee 



/* J- 



— X> J _~ ,;£) 



j-oo 



•oo 



ex.p"c. ( ~*~' o" -v7 ^^i wXt) . . # ct'jCji 



vaCTi(J*9 . 4 . G"^ 



-foo r -|- co 



CO w — CO 



+ •» 



■CO 



expt. — jr \ Sj_ 



ll ^£ V 2 

R iJ 



+ 28. 



\ 



^1 Q J rJ \ \ 



tttAj -i Ct'JO o . . • (Xvb <ji 



0, 9 



by writing x P /cr p = x r p , &c. 

jN ow, integrate first with regard to x\ writing first 

* « Phil. Trans./ A, vol. 187, p. 302. 



• I A 2\.» J , 
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Si 1^'pp f £ \ i^ q q / -^pq J f 
i I p ^i? / ' ^ l p w r pc y 

P / T) }> 

-•-^11 / / i -Lli-i c) » , JLti .-> . 

— t> I ^ 1 \ p ^ S I -p ^3T 



M . 



2 



/P P — P 3 \ 7 P P P 

"^^V KE n x p)^ l H\ k ERii 



±. ii /y»' /y> ' 



where t l is the sum for all values of _p, and 2 2 for all pairs of values of p and q, from 
2 to 72 inclusive, 



> \ i "» *^ ]j "• ^i 



'K' 



I 






+ 2S 2 ( -~ x r jfld q ) where H^ = % Y 



f-ry 

1") p 



11 



For if E/ = R n , or the determinant of the correlation coefficients, omitting all 
involving the first variable, i.e., the first row and column of R, the determinant W pg 
corresponding to the minor of the constituent r pq in R/ or to the second minor (. R n )^ 
of R is given by 

X\; ,pq — — I jrw^XAn -i "-"" JCVi £)JlVi q j Xv ■ »«.,., { XXI, ), 

and for _p = g, 

X.\j pp ■— — ( XSjp p-±A> 1 1 *"*""* XAji*) //J-u ....,,, ( XXII, ). 

Hence, integrating x', between the limits + °° and — go we have 



N 



Zocr^ . . . <r„. \/2tt \/ -^ 





f + oo 


r + co 


X 




• a • 


*> 


' — oo « 


— 00 J 



-f 00 



•00 



expt. 



i 

2 



•%* 



/P' 



p/ 

XV 






Jb o Gt JL> o • , • Cl <AV ^ , 



This is of precisely the same form as before, except that we have the factor 

V 2-7T v^R/IV, and the multiple integral is reduced by one integration and by the 
disappearance of all correlations involving the first variable. Now, integrate with 

regard to x^. The sole effect will be to multiply by a factor \/2u \ZW]Il ; \ where 
R /; is the minor of R ; not involving correlations of the second variable. Thus, by 
repeating the process, we have ultimately 

N = z^cr, ...<r H {\Z2tt) !1 a/R/R' \/WjR" x/W/W" . . . aJ ^ , 

TsT 



or 



(2«7ry* M f7"icr 3 . . . o*» v -lv 



\/K 



which gives the constant of the surface. 

The preceding investigation enables us also to deal with two further points. 

(a) Given n variables, what is the mean value m 1 + x l of the first variable and 

its variability cr / 1 for definite values m% + A 2 , m 3 + A 3 , . . m u + h u of the other 
(n — 1) variables ? 
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Clearly, when x ! % x f z . . . x f n are constants, the distribution of x\ is of the form 

R f /R ., \1 2 

constant X expt. — |- - -" « as^ + 2^ [ ^ M ^ P )\ « 

Or, re-introducing the <x 1? cr 2? . . . &,» we have a distribution about the point given by 



/■yt 



En CTo ' R„ CT Q ' R rl <7 W / 



X\ 1 Q 1//1 

1 \ ~D 



, l 12 ^2 I E 13 # s I i -"!« ^« \ 

Ti 1 ^ "T" -n "t~ • • • T~ t> / j 



w n "3 



with standard deviation 



<x 



i 



ii 



cr, 



> . 



K/R n 



(xxiv.). 



j 



(b) Given n variables, what are the mean values m 1 + x l9 m 2 + x & the standard 

deviations cr f \ 9 cr"^ and the correlation r n of two of them, when we give definite 
values m 3 -{-h 3 > m± + /^ . . . . m H + ^ to the remaining (n — 2) variables ? 
In this case we have from (i.) 



% , — expt}. ""**■" ij" i ^i i^i'*' i" ^(J]qX-\X^_ ~j~ CooX^ 



+ 2 ( c 'j:A3 + c u? l n + • • • + <-iA)*i 
+ 2 (o\j 3 A 3 + %A + . . . + <? 2 A) ^o 
+ terms not involving x^ and x%} . 



(xxv.). 



Writing K, for the coefficient of x u and K for that ofa?.,. we have for the centre 



i? 



<Xn 



(Ivi% 2 ~"~ ^3 g 12/ 



C ll 6> 22 



12 



2 ' 



X; 



( — K 1 c 12 + c n K 2 ) 



C 11 C 22 ~~ C 12* 



X 






/ ^'i m^oq 



1« 



^n^o.o " ^'i? 



, 3 ,l p J 9 



/yt 



^1 



6 i£ c il_~_fi^3 i 



11^22 



12 



% C 22 



£10* 



\ 
PI ' 



tX-T — "■"■"■" 0**T-Zn 



/P P 

/ -*-M «X\>o 



ip-^22 



13 1.) 7, 

x^xl 19 /^ 



7^ P _, P 2 



<T 






^/ 



-^Si* 11 ljP*M2 P 



11-^22 



by transferring to the minors of R and the as. Or. 



P IP lb V ' 



^1 — ~~ °".lSl I 13// „. 

Xl> U <i 






— CToSf 



p 2 « h 



%P ,v v 



df /V 



I'- 



ll" 



<T< 



P> 



■lh 



(xxvi.), 



Here R" is the determinant formed by striking out the first two rows and 
columns of II ; p f \ p is the minor obtained by striking out the second row and column 
from E, and then the first row and p th column ; p'% p the minor obtained by striking 
out the first row and column, and then the second row and p ih column. But a 

comparison with (xxiv.) shows us that these values for x l and x 2 are precisely what 
we should have obtained for the regression equations of the 1st and 2nd variables 
respectively alone on the other n — 2 variables. " Thus the existence and the 

correlations of x a have no effect" on' the value of x u nor those of x, on the value of x^r 

VOL. CO. — A. C 
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Returning to (xxv.), we remark that the terms of the second order in x Y and x. ? -, on 
which the correlation and variations depend, are not altered by a transfer to the 

centre x x and x :l of the array. 

Hence by (ix.) and (xvii.) we have 

r = — --^ = - 

This is the partial correlation of the 1st and 2nd organs for the remaining n — 2 
organs with constant values. 
Again, 



R l2 /\/li n R^ 



• • o I aaVII. )« 



1 



c n — 



*"i s (i - *w> 



3 G n 



— // 0/1 r , Oi 



Whence we easily find from (xviii.) and (xxvii.) 



(T i = 



2 



R R 9 c 



22 






(T 



" 2 



R It 



2 



o\ 



11 



X\-| -| XVf)g XV"! 



or, 






o-! V^/R 7 ', o" 3 = cr s v/Iin/R^ 



(xxviii.). 



where R" is, as before, the determinant 11 without its first two rows and columns. 
These by (xxiv.), are what we should have reached by ignoring x 2 in finding cr f { , and 

x x in finding a^ 

(3.) General Theorem in Selection. 

To find the selected means, the selected variations and selected correlations, ivhen 
q organs are selected, naturally or artificially, out of a complex of n organs. 

Let the selected group of q organs have their means raised h l9 h. z , /i 3 , . . . h q (some 
of these quantities may be negative) ; their standard deviations changed from 
o-j, cr 2 , , . . cr q to s l9 s 2 , s 3 , . . . s q , and their mutual correlations from r 12 , r vs , . . . r U/ , 

r 23? r 24? • • • V W ' ' ' r ?-l// ^° Pl2> Pl3> ' * • Rlq> P'2'6> Pu* • • 8 /V/? ' • • /V-l,?' 

The whole system of n organs before selection will be defined by the means as 
origin of measurement for each organ, by the standard deviations cr^ a\ 2 , cr 3 , . . . cr», 
and by the coefficients of correlation r l2 , t 13 , . . . r 1;/ , r 23 , r u , . . . r 2m . . . r w ._ lw . Let 
R be the determinant 



1? ''"l2> ^13? 

'215 L ? '23j 



« s 



* * 



1// 









■«2j 



r M5 



;/3? 



1 



and R wy the minor corresponding to the constituent r„,. Then 
population is given b)?" the frequency surface of equation (xix.), 



the unselected 
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z = constant X expt. 









For brevity, we can also write this in the form 



% = constant X expt. — \ {8 l (c pp x p *) + 2S 2 (c pt fCj(c^)} . . (xxix.) bis . 

Now consider for the time only 2 + 1 organs — namely, the first q organs and the 
m th organ (m > 2), and let us write R (w), if w be > a, for the determinant : 



R (t/,) = 



1, 


^12> 


^13? 


* 




V\n 


^21? 


1 


^23? 


« 


?%? 


r 2 n 


^1, 


^32? 


I, 


* t 


^V/5 




• 


• • 


• • 

^3> 


• 

• « 


• r • * 
• JL * 


• 

V 

U 


^d? 


^V,2> 


^«3> 


• i 


'?'//? 


1 



• • » • 1 2\.3\.j\. t J, 



Then if R (u) ptp „ be the minor corresponding to the constituent r pfp » 9 and if 

b p , p n— "tw^ ? the distribution of the ^+1 organs will be given by the 

frequency distribution 



z f = constant X expt. — \ {S, (b plp ,x/) + 2S 3 (b plp „x p ,x p! )\ . . (xxxi.). 

S L being a sum for every value of p f throughout the q + 1 organs, and S 2 for every 
pair of values. 

Now let the first q organs be given values h u h 2 , . . . h q9 then the mean value of x v . 
will be given by 

= - ib ,, + 






u - h l + ~ u h% + . . . + 7 qn K ) , 

. Villi U-U1I On 



'uu 



'i(U 



'uu 



_ / E ( u ) 



)u G"n 



Jl(u) uu or r 



fl ~|— - 



R (u\ 



iu G"u 



R(u) itu a^ 



/he* "i • • • -p> / \ 'If/ j • • (XXX11. ). 



Now these coefficients can be found at once if q be known. 
For example : 



g=l, 



(7 •_ - £j 



g = 3, - 



Jit yiljqu 
It \ll)uu 

It \ll)uu 

E (w) ww 



= r 



ltt 5 



JL "~*~ ' / -it) 



2u l n 



it (u) nu 






2u 



r ]u r i2 



1 — r 2 

L / 12 

x '12 '23 MS i ^' 12' 23' 13 



and 






and 






can be written down by symmetry 

C 2 



(xxxiii.), 
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Now suppose in the expression (xxix.) bls we were to put x { , x 2 , . . . x q equal to 
h ly h 2 , . . . h q respectively, then the system of equations to find the means of 
x g+i> x q+2> • • • x g+u> - • • x n f° r "this array would be the n ~ q equations 

?l,? + 2'h I C 2,<7-f2'^2 I • • • "T C q,q + %' l q~T C q + l,q + 2 X q + \ ~T C q + 2,q + 2 X <7+2 I • • • + C w, ? + 2 ^ « =: ^« 



C l« ™1 + C 2?* A-2 4" • • • + C <7,w ™? "I" C ? + l,« ^ f/+ l~f" C # + 2,« ^ r/+2+ • • • + C iin % ?i — . (XXXIV.), 

where ^ +1 , oo^ q+2 , . . . ^ ? are the "co-ordinates of the centre" of the array of the 
n — g' organs. 

If we were to solve these equations we ought to get precisely the solution for 
x '« (u > q < n + 1) that we have found for x r , H in (xxxii.) above, where none of the 
coefficients involve correlation-coefficients other than those of the first q organs 
among themselves and with the 11 th organ. This result flows from the pretty obvious 
law that the mean of the u th organ for an array determined by values of the 
first q organs cannot be in any way dependent on our considering the relation of this 
selection of q organs to any additional organs beside the u nx : see p. 9. 

Thus the solution of (xxxiv.) is simply obtained by putting u = q -f- 1, q + 2, 
q -f 3, . . . n successively in (xxxiii.). 

Let us now select the first q organs not with absolute values, but varying about 
means h v h%, . . . h p with standard deviations s x , % . . '. s q , and with mutual corre- 
lations p 12 , p l3 , . . . p ]q , /) 23 , . . . p m , . . . pq-iq. We have then to multiply £ in (xxix.) bls 
by an exponential quadratic function of the x^-\-h ly x 2 -\~h 2 , . . . x q -\~h p i.e., the selective 
correlation surface, and divide it by another exponential quadratic surface, i.e., the 
primary correlation surface of the q organs x l9 x % , . . x q . This follows, since the 
frequency of each complex of n organs must be reduced in the ratio of the selected 
to the primary frequency of the complex of q selected organs. But it will be clear 
that such a reduction must give us a result of the following form for the final 
frequency surface of the n organs : 



Z = constant X expt. — \ {c u x^ + c^x^ + . . . + e qq x 



2 



q 



"T* @q + l,q + % X q + l I * • • i ^im X if ) 

+ 2c V i V nX vl x v „ + • • • ;/ and v" < q + 1) 

+ 2c m x v x u + . . • ('*> < q + 1 and u > q) 

4- 2c nlv nX^x v n + . . . (u f and u' > q) 

+ linear terms in x v ,, x v „ (v f and v" < q + 1) • (xxxv.), 



where the cs denote the changed c's. 



SELECTION ON THE VARIABILITY AND CORRELATION OF ORGANS. 



13 



Now if we differentiate the quadric to find its " centre " we have n equations in 
x i . . . x q . . . x m but the solutions of these, if x q+l . . . x n were eliminated, are known 
to be the "centre" h u h. 2 . . . h r Hence we require only n ■— q equations involving 
x q+l . . . x n and we can put the lis for the remaining values x l . . . x q . Let us take 
the differentials of the quadric with regard to x q+l . . . x m then the resulting 

equations involve none of the cs, but only the as. They reproduce in fact (xxxiv.). 
But the values of x/ q+l . . . x n found from (xxxiv.) are, we have seen, identical 
with the values of (xxxii.). Thus we have 



x 



v+i 



X 



<7+2 









R(g-f 2) 2>g+2 cr q +2 , 






,x ^ — 



JR(7l) litt <7„, R(» 3 

lR(?i)„„ <r x R(^) M 



» 0"tt 



7 » I l^'j^M Cr ^- 



i In r * < 



(xxxvi.). 



These give the most general form of a theorem proved for a particular case in 
* Phil. Trans., 5 A, vol. 187, p. 300, c (ii.). If systems of q organs be selected with any 
arbitrary variations and correlations out of complexes of n organs, then the mean sizes 
of the remaining n — q organs have precisely the same values as if the selection of all 
the systems of q organs had been to one size and not varied about mean values. The 
arbitrary variations of the selected systems about these mean values, as well as the 
arbitrary correlations, have no influence on the mean changes of the n — q organs. 
Returning to equation (xxxv.) we know that if the determinant 
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c 



1,1' 



'2,1? 
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1,2' 



C 



1 o 
1, o 







9? 



c 
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'1^+1 
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2,17' 
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« • 
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» • » 
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c q& 


C <?,3 
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• 


• 


Oqr/> 


% + l 


c 0+],l> 


c ?+i,2> 


%+l,3 


• 


e 


• 


G q + ],q> 


C !Z + 1,§'+1 


• • 


* « 


« • " • 


* 


• 


• 


# • * 


« • • • 


• • 


• • 


e • ft 


• 


• 


• 


• • • 


• • • « 


£#,!> 


£«,2? 


^«,3 


• 


• 


• 


k#, ^j 


£/?,?+ 1 



a,;/. 



( \ 11 



« • 



C 



?,» 



. c 



0+1, » 



'/m 



(xxxvii ) 



be formed, its constituents and not the linear terms in the exponential of (xxxv.) 
determine all the standard deviations and correlations. Let S w be the variation 
after selection of the u th organ ; then if u be one of the selected organs 2* = s u , if u 
be for one of the tinsel ected organs % t has still to be found. Let X 1W be the correlation 
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coefficient of the u th and u th organs after selection, then if u and v be selected organs, 
X uv = p W} ; if they be not both selected organs, then it has to be found. Let D uv be 
the minor corresponding to the constituent c 1(V in the above determinant, then by 
(x.) and (xiv.) 

D 

V 2 — un l '" \ 

^4 11 — ~~ * .•••••»•• I XXX. V 111. /, 



and 



D 

> > V — — I "V W 1 "V" 1 



Now let us write for brevity a 1(n = D, /m /A and a.„ ? , = D,„ y /A. Clearly as long as w and 7; 
are less than g + 1, a.„„, and x nv will both be known, i.e., are equal to &„ 2 and s H s r p, n , 
Now by a well-known property of determinants 

C l,!7+l a ll + C 2,q+l a 12 H" C 3,^+l a l3 + • • • + C $W + l a l!7 "^ C ?+l,?+l a l,? + l + • • • + 0w,0+i a iw = 
C l,? + 2 a ll "«" C 2,? + 2 a 12 + C 3,?+3 a 13 + • • • + °q,q+% a iq + C ? + l//+2 a L 7 + 3 + • • • + 0«,tf+2 a r« = 



• a •*»#«• «e« 



Ci.^ii + c 2|W a 12 + c M a 13 + . . . + c^a^ + fy+i,« a i,?+2 + . . . + c ff „a lw = . . (xl.). 

Comparing these equations for the n -— q unknowns a w+1 , ol Uq+2 . . . a 1;i with (xxxiv.) 
for finding x' Q+l , % q + % . . . x p , we see that they are absolutely identical if we change 
h } , h 2 . . . h q in the latter into a ]]} a ]2 . . . a, //s Accordingly the solution is given by 
(xxxvi.), or we have 



a 



— _] ^_(2 r + 1 )i.?+i °V+i , E(g+l) 3 , rh °>+i , . ^Gz + l-Wi °>+i i 

Lm ~ lR(2+iWi -i n + Ute+Wi ** j* + ■ • • + R( 2 +i), +ltm ^ a "J 



a l,?+2 



R (g + 2) lt g+g cr g+3 E(?+2). 2 ^ +2 a> /+2 E if + 2 Wi ^ 



?+2 



<%i n — ■" ' i -r» / "\ a n "T" tw'T ~ ~ a i9 "T~ • * • i ^ 7 ~\ ^ia 

lE(w)„ iW o-! U n E (%)„,„ <r 8 U ^ r E (tt) ttf „ a, 1? 

Provided t 1 be < ^ + 1 we might equally well have used a rl , a,, 2 , . . . a, T Hence 
we conclude that if v be < g + 1 and ^ be > <£, 



°^r^ 



E <»i« <r n E (%) 0? , cr„ E (u) qn a n , , . 



_E (%)«« o"i E (^) w „ <r 2 * * E (u) nn cr (j 



Or, substituting the known values of a vu a m , . . a, 7 , we have for v < g + 1 and 
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? ? r — _ • I E ^ ln S] -L R ^ u - , , R M ™ ?l , 



R (u) uu 



i 



The next stage in our work is to find a m when both v and u are > q, and also a mi 
when v -= u and both are > 5. This is done at once by substituting the minors a. nU 
a,, 2 , . . . a W7 . . . ol wi in the equations formed from the last p — -g lines of the determinant 
(xxxvii.). 

We obtain the following system : 

C l,(/ + i a Hl + ( h,q + \ a i(2 "V C Z,q + i a i/4i + • • . + (-'q,q + \®>uq + C q+ V/ + 1 a„ ; (j+1 "T • • • ~T~ C u,q + i a /m = 
C l,^+2 a wl "T" C 2,?+2 a -/*2 "T* C 3,</+2 a w3 "T • • • T" °q,q+2 a /iq "T" ^q + l,q+2 a n,q + l ~f~ • • • ~f" Vn t q+2 a wi = 



(h,n a nl ~T" &2 t n f X"ia "T~ ^3,v/. a w;j "T" • • • T" G q,vP-nq ~T~ C £+l,w a //,?+i "T~ ' • • I c u,n a wi — 1 



These are identical with equations (xl.) 5 except that the equation with the 
coefficients c im c 2 *> . . . c m has unity instead of zero on the right-hand side. Hence 
we see that a, /w (y > q) will be the same function of a, //U a 1(2 , . . . a nq that x v is of h l9 
k 2 , . . . h q in equation (xxxvi.), but it will add to this a function of the last p — q 
system of c s. i.e., the es 



Cq n> Cq + i^i-, . . . 6n 7 u* 



Whatever this function may be we will represent it for the time by y uv ; we notice 
that it is independent of the selected variations s l9 s 2 , . . . s q , the selected means h v 
h. 2 , . . . h cp and the selected correlation coefficients /> 13 , /) 23 , . . . p g -. ltr It depends 
only on the characters before selection. 

We thus have 



<x~uo = y 



■no 



h ( /;),, o- 1 K (v) 0B or 2 11 (v) vv 



\ ^"UQ 



Now the system a.,, l3 a.„ 2 , . . . a ufj can be found from (xli.), since a. a , n n = a n „ n whatever 
v! and it" be. 
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Hence we have 



0L iiV — - y nD i 0"v@"n 



1 ' R (v) vv R (u) m \a-J 



+ 



q [ fR (v)p* R OQ^ 



RO%. R(%)„„ 



'«?/ 



W 



Sp/Spt! 



Here S } denotes a sum from j9 = 
of p and p'' out of 1, 2, 3, . . . q. 
When u = v we have simply 



1 to _p = 5, and S. 2 a sum for every pair of values 



ci t)0 



Jco + OV S x 



L 






, 4> a /' 1{ ('"V« Il .(: p )r' 

„ + ^ l x (E(t-)„,) 3 ^ 



Sp,Sp!/ 

(TprCFp/,, 



V 



It only remains to determine y, //; and y^.. This we can do by putting all the ss 
zero, or selecting our q organs of one size only. We see at once that y m and y eG are 
the values of a iW and a n . 9 that is, of %XiXvm and 2/, when we select (/ organs of 
definite values and seek the correlation and the varia.bili.ties of two others, the u th and 
the v th . These values have already been found on p. 10, Or : 






U-1V*VU 



7 



111) 



ar 1( <r v Il (uv) itv /li(q). 



S/ = y/ = or/R (uv) 1llf /R (q), 
%? = y t ? = cr/R (iw) vv /R (q), 



. « » . . ( XJLll. y. 



The notation of that page has been changed so that R (iw) now stands for the 
determinant 



'21? 



31? 



'//!) 



•01 > 



12? 



r, 



32? 



r 



J 3? 



1 



/i-» ..-■)•» n,t 

' r/1? ' <?2? ' <?3; 



' w2? ' ?'3? 



^,2? •/ 



03? 



rat /y* 

1/7? ; l«J ' IV 



1 .'P. '/' 

J- « / S**3* e e © / v 



.'/> 






'{-> 



v 



ryj 



^Vw? 



0//J 



n„ 



/y» /l'» /T/i 

' 3f/? ; 3//? ' «» 



3y 



J. , ' ,yy/, / / 



7/; 



■9' 

' /A/ 



VI 



(xliii.). 



R(uv) uv is the minor corresponding to the constituent r 1w ; li(uv) im) the minor 
corresponding to the constituent at the meet of the u th column and zt th row ; and 
R (q) the determinant with the last two rows and two columns struck out, 

jd or example i 
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g=l: 



( 1 



2 : 






Jim 



y?/?!- 



o-/(l — r lt ?) y /m = 0-/(1 — r v r)> 



0"v 2 ( 1 



r 



12 



2 



cr/(l — r 12 2 



n/ — n/'+ 2r 12 r 1 ,r 2r )/(l — r 12 2 ), 

^T,v ~~" 1\n "1 4V vl V lv V. ln ) / \±- " '^12/? 



y Vft = <r v <j r , {r nf (1 — r 12 2 ) 









or,, 2 ' X 



^21? 



'^31? 






; 12? ' 13? ' lw 

'^32? 1? ?V' 

^'2? ^'3? 1 



?W~ n^'V, + ^12 Ow + r, ff r lr )}/(i — rj), 

' + (1 _ r^s _ ^s _ ? . 3i 2 + 2r 12 r 23 r 31 ) 



T^w — 0" # X 
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1*13, 
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'''•21) 
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■ ''2:!) 


?*2« 


''81, 




I, 


?';!« 


! /-)/-?« 


/ l/On 


7' ... 
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*■ (I - 



r ]3" r 33~ r 3l" ~T~ ^rjgTj^'gJ 



r 



•//?; 



0",,0" 



« 



1 /y* /?"» i 1 

•*■? ' 12) ' 13? ' ]?' 



^219 1? r<) ^" /}t ' 
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^32? 1-j '^3?? 












~(l 






r 13 



1 :2:\ '31 1^ ^' 12' ;?S' 31/ 



» * # 



(xliv.). 



We can now collect our complete results. 

The variability of a non-selected organ v > q is after the selection of q organs 
given by 



2 „_ 



<T„ 



B(ra)„ 



+ s, 



/R(^A 2 Y' S >V 



jfv J 



2 bo 



lb (^Vj^'y Ji \V)p//v S-p-'Sp'/ 



(R (4,) s 



n'^// 



W 



/T" /T" 



, (xlv.) 



The correlation of two non-selected organs v and u both > q is after the selection 
of q organs given by 



"Sy V V 



CTniT 



U^ V 






L B (v)„ R (u) 



hm \ G"p/ 






J ^ K (V) p l g ( ^)^ ^^'V^jC^V^ n 'VV [ 



K (v) w li {u), IH R (v ) vv R («) 



/^/j'/y" 



«K 



CT^O-^,, 



J 



(xlvL), 



The correlation between a non-selected organ u > q and a selected organ v<q-\- 1 
is given by (xli.) bis or, 



"1)^/^1 ?•# ■— S v <T n 



ix \U)j m Sj \ 

1 lBW« Ppv - 



(xlvii), 



* The expanded values of these determinants are given, < Phil. Trans./ A, vol. 187, p. 294 t 
VOL. CO. — A, D 
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Here In 8^ : p takes every value from 1 to q y and in S 2 : p' and p f every possible pair 
of values from 1 to q. Equations (xlv.)-(xlvii.) fully determine all the required 
quantities and form the full solution of the problem of selection. Before we see the 
remarkably simple forms they take for simpler cases, we may draw some general 
conclusions of a most important character. 

In the first place we must distinguish between directly selected and what we 
have termed non-selected organs. It would be better to term the latter indirectly 
selected organs. Suppose the recruiting sergeant were to pay attention only to 
stature and seek to form a regiment of men of about 5 feet 10 inches. He might 
have a real range of stature about 6 or 7 inches, but he would strive to get men 
of about this height from the population. We will suppose that he did not consider 
chest-breadth, head-length, foot-length, lungs or any other character. The distribu- 
tion of these " non-selected" characters in the regiment would not be the same as in 
the general population. Their means would have changed by (xxxvi.) and their 
variabilities and correlations be given by (xlv.)-(xlvii.). In other words, an indirect 
selection would have taken place. A. selection by stature would change foot-length 
and head-length and indeed every other correlated organ. Much the same result must 
occur in natural selection. If it be advantageous for a species to have a certain group 

<_> X ox 

of its organs of definite size, falling within a definite range, and related to each other 
in a definite manner, then these changes cannot take place without modifying not only 
the size, but the variability and correlation of all the other organs correlated with these, 
although these organs themselves be not directly selected. Practically this means 
all the other organs, for so far one can hardly say with certainty that we have come 
across any two characters in an organism which are uncorrelated. Many of those 
investigated are highly correlated, all appear to have some correlation, even if it be 
very small or negative. 

We may therefore conclude as follows : — 

(a) The selection of any complex of characters or organs ia an organism, changes 
all the other characters and organs not directly selected. 

(b) If the change in the complex be continuous and progressive, the other 
characters will continue to be modified until the change in them is so considerable 
that selection begins to act directly upon them also, 

(c) The changes noted here are not confined to the average value of a non- 
directly selected character and to its variability ; the correlations between non- directly 
selected characters and the correlations between directly and non-directly selected 
characters are also both changed. 

(cl) If local races have been produced by selection from a common stock, it will 
be impossible to look upon correlation as a criterion for species. Every selection will 
modify such correlation, and it has no greater fixity than either type value (mean) 
or variability (standard deviation). 

The whole of these statements will become more manifest as we apply our general 
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theorem to special cases, but we must note that if two organs were uncorrected with 
each other, it still might be possible by selecting a third, or a third and fourth, to 
produce correlation between them. Further, by selection of one or more organs, two 
non-directly selected organs can have their existing correlation increased, lessened or 
even changed in sign. 

(4.) A primary difficulty will of course arise in the case of natural selection. How 
are we to determine which are the directly and which are the indirectly selected 
organs ? With artificial selection by man, w r e know which organs have been selected 
fairly well ; attention has been paid to colour, size, proportion of parts, &c. Even in 
the case of the medical examination of the recruit, it is chest, lungs, heart, stature, 
&c, which form the basis of the acceptance or rejection. If the head or foot be not 
absolutely deformed, little if any attention is paid to them, so with hair-colour, 
probably eye- colour, and a mass of other details. No doubt the direct medical 
selection indirectly selects these, but we could roughly class the selected and non- 
selected organs or characters and investigate the changes in the correlations of the 
latter owing to the indirect selection. But how are we to form these classes in the 
case of natural selection ? 

The investigations may look difficult, and even from the standpoint of arithmetic 
appalling, but it seems to me that the differentiation of organs into directly and 
indirectly selected classes is the keynote to the problem of evolution by natural 
selection. 

Let us look at a simple case and see whether it will throw any light on the 
problem of distinguishing between directly and indirectly selected organs. Suppose we 
have two organs only, with means m 1? m^ standard deviations cr 1? cr 2? correlation r 12 , 
and let the first be selected so as to have a mean value m { + h l9 and standard 
deviation s v Let S 3 be the standard deviation of the second organ and r B the 
correlation of the two organs after selection, and m + X ; o the mean of the non- 
selected organ. 

Then by (xxxii.) : 

and it will be shown later (see p. 23) that 
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In other words the regression coefficient of the non-selected organ on the selected 
remains unchanged, while that of the selected organ on the non-selected will, as a 
rule, be widely modified. 

.Further, let Xo be the mean value of the second organ before selection corresponding 
to a value H 1 of the first ; let M x and M 2 be the means of the organs after selection, 
and Y 2 be the mean value of the second organ corresponding to a value Kj of the 
first. Then the equation to the regression line before selection is 



and after selection it is 

v — 

JL o ■ — - 
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But this is identically the same line as the regression line before selection. Hence 
not only the slope (regression coefficient) of the line, but its position is identical, and 
we have the following result : — ■ 

If two local races have been evolved from a single stock by the selection in different 
ways of one organ o?ily, then the regression lines for the two races of any own-directly 
selected organ on the directly selected organ ivill be the same in direction and position; 
but the regression lines of the selected organ on any non-selected organ ivill differ for 
the two races* 

Of course the means, standard deviations and correlations, not only of the selected 
organ but of all the non-selected organs also, will probably have changed. It is only 
certain of the regression lines which remain unchanged and serve as a criterion to 
enable us to distinguish between directly and iron- directly selected organs. 

Of course the problem in Nature will not be as simple as this, for differentiation of 
the two local races may have arisen from the selection of more than one organ, or 
may have arisen from the selection of two different organs, but the illustration will, 
I think, indicate the nature of the investigation we are proposing. 

We can easily generalise our theorem by considering the form of the selection 
surface given on p. 12. Any result obtained from (xxxv.) which does not involve any 

of the cs will be a result unaffected by the selection that has gone on. Now to 
obtain a regression equation we put any number of the xs equal to constants, to lis 

* The geometrical interpretation in this simple case that the regression line is unchanged is quite 
obvious, and, indeed, may serve as a proof. 
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* 

say, and find the " centre " of the quadric of the remaining x's, the co-ordinates of 
this centre, expressed in terms of the Jib, are the regression equations, Now it will be 
clear, that if we put all the selected x's equal to h J s, the differentials of the quadric 

with regard to the remaining or non-selected x's can contain no es or the coefficients 
of the regression equations thus found will not be modified by selection. 

Further, we might have given not only the selected organs, but any number of the 
non-selected organs constant values, and the resulting regression equations would 

involve only the cs and not the as. 

Hence we have the following general theorems :■ — 

(i.) If an organ has been modified only by indirect selection, then its partial 
regression coefficients on any complex of other organs, however large or small, provided- 
it includes all the directly selected organs, will remain unchanged by the selection. 

(ii.) The same organ in tivo different local races which have been derived from a 
common stock by the selection of two complexes of organs, some of which may or may 
not be common, to ill, if it has not been directly selected in either case, give the same 
partial regression coefficients for any group of organs which includes the members of 
both complexes and, any number of -non- directly selected organs besides. 

If the partial, regression equations have changed coefficients, then we cannot at 
once determine whether— 

(a) We are dealing with a non-directly selected organ, and have not included all 
the directly selected organs in the group upon which we are calculating the regres- 
sion ; or 

(b) We are really dealing with a directly selected organ. In this case, we have 
also certainly not included at least one directly selected organ, in the regression 
group. 

Theoretically, however, (i.) and (ii.) suffice to find out which, if any, are the non- 
directly selected organs in the differentiation of local races. Practically, however, 
the number of organs and characters may be so great, and our ignorance of those 
probably selected so complete, that the arithmetic of determining so extensive a 
series of partial regression coefficients may be quite beyond our powers. Still, where 
the divergence between local races is not too great, and the source of the differentia- 
tion not too obscure, it is probable that the above theorems will lead to results of 



& 



great interest. " 

Without laying too great weight on these theorems, I would still venture to 
suggest that if the criterion of a species be the discovery of any numerical constant 

* Mr. L. Beamley-Moore has been working with this end in view at the long- bones in man. But 
even here the direct selection of parts of the vertebral column — for which, at present, we have no correla- 
tion values either among themselves or with the long-bones— and of the hand and foot, which 
Dr. W. E. Macdonell has just shown, are very highly correlated with the long-bones, may render 
■nugatory all attempt to ascertain which, if any, long-bone has been only indirectly, or, at any rate, least 
directly selected. 
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or group of constants, which is the same for all local races, then these constants must 
not be sought in the values of mean characters, degrees of variability, or of correlation, 
but in a system of partial regression coefficients, and the discovery of these is therefore 
of first class biological importance ; it is the classification of the characters into directly 
and non-directly selected groups, i.e.> it is the discovery of the modus operandi of the 
factors by which the differentiation has taken place. We are a long way from 
solution yet, but we may venture, perhaps, to admit a faint glimmer of light in the 
direction of what might seem the culminating problem of the mathematical method 
as applied to evolution — the piecing together by quantitative analysis of the stages of 
descent. 

(5.) I will take now the application of the above results to simple cases ; but for the 
benefit of those who cannot easily follow the main principles of our investigation 
through the stages of determinant analysis, I will prove directly the proposition that : 
the selection of an organ A alters the mean and variability of a correlated organ B, 
and also the correlation between A and B, 

Let the correlation surface for the two organs be 



N 



Wi0 - 2 v/i -;r* 






where N is the number of individuals in the general, population before selection, and 
the subscripts 1 and 2 refer to the organs A and B respectively. 

Let the distribution of the population after selection of the A organ be 



— e "^ 2 , 



V / 2irs l 

where N — n is the total destruction, h x the mean and s { the variability of the 
population with regard to A after selection. Before selection this distribution was 

2i = ~jir~~ e ^ * 

V 2>7r<r l 

Hence, the selection being random with regard to the array of B's corresponding to 
any A, we have for the surface after selection 

or each array must be altered in the ratio of the corresponding ?J to z v 
This gives for the surface in full 

7 — M + _ J_ ] 3 / 1 _. - 4- - - \ _ 2r ^ x 3 _ 

/U - W 6Xpt - 2 1 ^ W ^ ^ <" (1 - r 13 *) *& (1 - V) 



1 ^'H^'l 

<v(i-V) s i 



+ x i ^ . , . (xlvm.). 
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Integrate this for every possible value of x l and we shall have the distribution h% and 
s. 2 of x 2 or B after the selection of A. After some reductions we find for the frequency 

L,oX,2 
where 



which gives the standard deviation of the indirectly selected organ, and the " centre " 
of this organ is given by 

IVc) — — — /tj, . . . . . , , . , , 1.1., I, 



1 



These are special cases of our results (xlv.) and (xxxvi.) above respectively. 

Further, returning to the correlation surface (xlviii.), the coefficient of correlation 
X u is the coefficient of cr T r 3 divided, by the product of the coefficients of x x 2 and x. 2 2 . 
Hence we find 



*i ^ 



1 V 1 - l - L -^' " 



o / ' 12 

a \ 1 



Let r n = sin J2 , i* 12 = sm</> ]2 , then we have 



*i 



tan</> ]3 = : tan ^ 12 . (Hi.). 



°"i 



This shows us that <j) n decreases with $-,, or that t 12 decreases with s l3 that is to say, 
the more intense the selection the less is the correlation. This in broad terms 
demonstrates the general principle that intensity of selection connotes a lessening of 
correlation. Tt is this principle which very possibly accounts for the fact that the 
more civilized races of man appear to be not only more variable but more highly 
correlated than the less civilized, among whom the struggle for existence is more 
intense. It may, perhaps, also account for the skeletons of women of the civilized 
races having their parts more highly correlated than the parts of those of men. # 
Lastly, it may well throw some light on the markedly plastic character of races 
which have been stringently selected with regard to one or a few organs only. 

As an illustration, suppose that the correlation between femur and tibia were *7, 
and let us investigate what would be the effect of reducing the variability of the tibia 
by direct selection 50 per cent. We find at once on substituting s l /cr l = '5 and 
r n = '7 in (li.) above that r 12 = "44, or a reduction of about 37 per cent. This will, 
perhaps, be sufficient to indicate what immense changes must be made in the 
correlation of highly correlated organs whenever selection, artificial or natural, is 
stringent. It is important to notice that the change in the size of the organ, in no 
* See LjKE and Pearson, * Eoy. Soc, Proc.,' vol. 61, p. 354; and Lee, 'Phil. Trans.,' A, vol. 196, p. 231. 
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way influences the change in the correlation between organs, if the distribution be 
normal, # the change depends only on. the stringency of the selection. Breeders who 
select. by the size of an organ only are in that case very likely to reduce the variability 
of the organ in the selected group by far more than 50 per cent. Accordingly, it is 
not to be wondered at if they, to a great extent, destroy the correlation between the 
selected organ and other organs. This destruction would appear as a want of relation- 
ship between parts, possibly as a want of fixity in type. 

By means of equation (lii.) t ]3 can easily be found from r i2 and p = s l jo m v . All we need 
is a table of trigonometrical functions. We observe that T B is always of the same 
sign and less than v 12 . For many biological purposes the following graphical con- 
struction gives quite sufficiently accurate results. Let CAB be a quadrant, say of 
10 centims. radius, and take the point P on this quadrant distant PN = 10r 12 



x\ 



1 



from. AB. Take ON = -- PN, and let AG meet the quadrant in R, then EM the 

P 

distance of R from AB = 10%, and consequently determines r 12 . If the figure be 
drawn on decimal paper the determination of r^ is peculiarly easy. 



Graphical method of finding correlation between organs A and B after selection has acted on A. 

5c 



Sc&Le of r (2 and R l2 . 




In the above example r\o = PN 



"7; s, 



(T 



V 



— ■ J, and therefore QN =- r^ij) 



• QK 



35. 



X 12 = Ei 2 = R-M = '44, as before. See p. 23. 



* It will do so if the distribution be skew, see 'Phil. Trans./ A, vol. 191, p. 231, 
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Further, taking AP as our unit, AN 3 =1 — r 12 3 , and QN 



— '12 



X -^-s. Hence. 



<7i 



from (lxix.) : 



'2 



2 



cr 2 2 (AN 3 + QN 2 ) == a-/AQl 



Therefore if Acr 2 in the diagram be taken equal to cr 2 and Q2 2 be drawn parallel 
to Rcr 2 , we shall have AS 2 = S 2 , or we can scale off the reduced variability. 

Thus the diagram enables us to see at a glance the reduction in correlation and 
variability. 

(6.) Let us now write down the results when an organ A is selected out of a group 
of three organs, A, B, C, whose constants are marked by the subscripts 1, 2, 3, 
respectively. Let fi l = sjcr^ and be represented, when required, by cos Xi- Then 
we find from (xlv.) (xlvii.) : 
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r i2 3 "l> ^ 3 """ r 13 ' "l (I 111 -) j 
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(To < 
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^i2 2 \ = °a {1 - sin 2 xi cos 2 12 }* 
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2' 



' 13 



1 / 



= o- 3 {1 — sin 2 xi cos 2 13 }* 



y . (liv.). 
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Wl2 

{1 - (1 - A^iVi* 8 } 1 

Wl3 



cos % T COS 



12 



{i - (i - ftV,/}' 



-\/l — sin 3 % 1 cos 2 13 

COS Xl cos ^13 

\/l — sin 3 % x cos 3 # ]3 



. . > (lv.). 



fas - Vl2> ( 1 - "h ) + T? r 23 
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<Tt / CT, 



S1 ^. 



12 
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<7"i 



/i'» 2 
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cos # 33 + sin 3 %i cos 13 cos # 



12 



. • 



— sin 3 Xi cos 3 13 ) (1 — sin 3 Xi cos 3 1S ) 
where, as before, we write r m = cos 6L. Let us also write x na = cos ® m > and 



w 



sin Xi cos 12 = cos a 12 , sin Xi cos 13 = cos a 13 . 



Then we can replace the above results by 



(lvi.), 



2Lio — CT 2 SID. C^io, 



S 3 = o- 3 sin a 18 , 



cos © 12 == cot Xi cot a 12 , cos © 13 = cot xi cot a 13 , 



cos ©00 = 



^o 
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__ COS # 23 — COS « 12 COS # 13 

sin <7 10 sin « in 
1* i.) 

E 



• • • . • 



(lvii.). 
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These equations admit of easy interpretation by spherical geometry. 

P 




Let P be the pole of the great circle DEFG. Take DG = S3 , DE = 6 U , GF = 13 . 
Join P to E and F; let the small circle of radius Xi round P meet PD, PE ? PF 5 
and PG in d, e 9 f,g respectively. Draw the arcs De and Gf. Let the small circles 
with centres D and G and radii De and Gf respectively meet in Q. Join DQ and GQ. 
Then the quantities required are : 

~ sin DQ, ^3/^3 = sin GQ, 



<w.i/0"» 



% 



cos DeE. 



^13 



cos F/'G, 



v 33 = cos DQG. 



F 



or 



DE = #,,, Ee 



7T 



2 



I — \ *T~^ * • 



9, 



hence : cos De = cos 13 cos Ee = sin Xi cos 13 = cos a 12 , or a I2 . = De ; 



similarly a 13 = G/! 

Next, cos DdE — cot De tan eE~cot a 1:2 . cot x l5 or /_ DeE = 12 ; similarly < F/G 
Lastly, from the triangle DQG : DQ = a m QG = a 13 , and DG = Q m , but 

cos DG = cos DQ cos QG + sin DQ sin QG cos DQG, 



®13- 



or, 



cos DQG 



cos 0o- — cos ftio cos « v , 

bill Ct i r> bill til *> 



cos © 2 3 ; or /_ DQG = © 33 . 



Thus all the relations can be expressed in terms of the sides and angles of a simple 
system of spherical triangles. For the degree of accuracy generally possible in 
biological and sociological investigations these triangles can be solved by a spherical 
trigonometer, such as that sold by Kreedl, of Prague. # The changes, however, 
which r 33 undergoes for various values of r m r j3 , r 23 are, indeed, far more difficult 
to appreciate as a whole than those of r ]3 or r 13 . In order that they may be followed 
easily, and in order to solve directly to a degree of approximation sufficient for many 
practical purposes problems in the influence of selection on correlation, my assistant, 
Dr. L. N. G. FlLON, has kindly drawn up the tables which accompany this memoir. 

* It will suffice fairly well for all but a few special values of r :i 2, r 2 3, ^'si- 
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Of course to bring them within any reasonable compass we have had to limit the 
values taken. In the first place we have considered only eleven grades of selective 
stringency given by 

Sj/o-! = 0, 1/10, 2/10, 3/10, 4/10, 5/10, 6/10, 7/10, 8/10, 9/10, 1, 

the corresponding values of r 23 in the tables are entered as 

RP ppppppppp 
q, 1-V], .l.Vg, .I.V3, .LVj., XV 5 , .Li fi , XVy, .JA,g, IXfj, XV]q. 

The tables are calculated for r 23 both positive and negative, but r 12 and r 13 are 
always supposed positive. If r ]3 and r 13 be both negative, then r 23 will be the same 
as if they were both positive. If r 12 and r l3 be of opposite signs, then all we have 
to do is to look out r 23 in the table in which r 23 has a sign the reverse of its actual 
value, and having found the corresponding value of r 23 , then change its sign to obtain 
the actual coefficient of correlation after selection. This follows, if r 13 be the 
negative coefficient, by writing : 

v _ , % ± h *b* sin2 Xi ( -^3) + (- ^ib) (^sin»x _^ 



" ^{l-n^in^iHi- V Ri n s Xil \/{l-^ 3 8 sin 8 x } {1 -(- r 18 ) 8 sin 2 X i}' 

Lastly, it would clearly be very laborious to tabulate r 23 for a very great series of 
values of r ]2 , r 13 , r 23 . Accordingly a selection had to be made of these coefficients of 
correlation. They were given, the values 0, # 25, '5, '75, and 1. These may be spoken 
of as zero, small, medium, large, and perfect correlations, and the ranges to '25, 
•25 to *5, # 5 to # 75, and '75 to 1, as the ranges of little, moderate, considerable, and 
high correlation respectively. There would thus appear to be 1 5 combinations of values 
for r 12 , r 33 ; these are given in the key to the tables as (a), (b), (c), (d) . . . (m), (n), (p), 
see p. 63. If these 15 values had to be combined with, the 10 values (5 positive 
and 5 negative) of r 23 and the 11 values of sjcr^ we should have 1650 entries in our 
tables. But this number is much reduced by the consideration that the expression 
1 — r 23 2 — r 13 s — r ia 3 + 2r 23 r 13 r 12 has for the real correlation of three characters to be 
always positive. r 23 can also never be greater than unity. Accordingly all values of 
r 23> r i3> r i2> which do not satisfy these conditions, have been excluded from the tables ; 
they cannot arise in nature. A few impossible values of r 23 have been included in 
the tables, but these are placed there solely for the purpose of finding by interpolation 
values of r 23 , which are less than unity. The following purely hypothetical illustrations 
of formula (lvi.) and the tables will serve to indicate their use. 

Illustration L — Suppose the correlation of tibia and femur with each other to be 
•8, and of both with the stature to be *6. How would their correlation be altered if 
the variation in stature were reduced by selection to half its present value ? 

Let sja-i = /x 1 as before, and suppose r 23 = R ; then let /x x , r n , r 13 , r 2S be the values 
of the constants next below the required values occurring in the tables, and giving 

TE A 
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t 23 = R ; let R' be the true value of r 33 , corresponding to the values /x } + S/x 1? r 12 + Sr ]3 , 
r is + S^i3 ? r 23 + Sr 33 . Thus we have, as far as first differences : 

R' = K + 10 (A W R) 8^ + 4 {(A, R) Sr n + (A r „R) Sr 13 + (A r23 R) 8r 23 }. 

In our case /^ = ^-q, r 13 = '5, r 13 == *5, r 23 = 8 75 ? 8/^ = '05, Sr 13 = Sr 13 = '1, 
Sr 33 = *05. Further, we look up Table IV. (a), and the nearest case is (j) under R 3 , 
which gives R = '67105. We then see that (A ri8 R) + (Ar 13 R) = difference between 
(j) and (m) cases = — '21455 ; (A r JEl) = difference between result in IV. (a) and 
V. (a) = '32895 ; and lastly (A^R) = difference between R 3 and R 3 columns of (j) 
row of Table IV. (a) = '00535. Thus we find : 

R' = -671050 + '002675 - '04291 + '06578 = -6966. 

The value by straightforward calculation of formula (Ivi.) is *6981, the two results 
giving substantially the same value '7. Thus we see that such a selection would 
reduce the correlation of tibia and femur by 12*5 per cent. 

Illustration (IL). — Suppose the correlation of humerus and femur to be '5, and of 
those with stature to '7 and '8 respectively. How would the correlation of humerus 
and femur be modified by a selection of stature given by sj^ = '5 ? 

In this case, /^ = '5, r n = '5, r ]3 == '75, r 33 = '5, S/Xj = 0, Sr 13 = '2, Sr 13 = '05, 
Sr 23 = 0. We turn to Table III. (a) and take out (k) under R 5 , which gives us 
R=-3192. We have A r K= — "1841 and A r R = — '04185, whence we find 

L/i Jo * . 

R' = -1636, but the differences of the table are too great at this point for the 
result to be very trustworthy. # Suppose we take /^ and r 33 as before, but r 13 = '75, 
r l3 = '75, and therefore R, to be found from (m), = '1351 ; then Sr 13 = — '05, 
Sr 13 = '05 and A r]2 R = ■— *1841 as before, A ; . i3 R = — '2995. Hence we deduce 
B/=-1120. The mean of these two values of R' is '1377, and the true value 
calculated from (lvi.) is R' = '1395. Taking '14 for the practical value, we see that 
the correlation of humerus and femur has been reduced by this comparatively 
moderate selection of stature upwards of 70 per cent. ! 

Illustration (III.).— -Suppose a case in which humerus and femur were not correlated, 
but that both were correlated '7 with stature. What would be the effect of 
selecting stature with the same intensity, i.e., s^o-j = e 5 ? 

Our best results from the tables will be to take R 5 (m) from Table I. (a), which 
gives R = — '7297. We have then r 12 = r 13 = '75, hence 87\^= — '05, 8r 13 = — '05. 
A^ ]2 R is to be found from (m) and (h) and = — '3193 = A rj3 R, and 

B' = - 7297 + (A„ R) 8r 12 + (A,. R) Sr 13 , 

= - '7297 + '1277 = — '6020. 
The actual value by formula is — '5810. 
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Second differences ought to be used, and the process indicated is practically equivalent to using 



them, 
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Now this again is a remarkable result ; by selecting an organ correlated with 
two others, neither of which are correlated with each other, w r e have produced a 
considerable correlation, and what is more, one of a negative sign. 

In other words, if humerus and femur were unrelated to each other, but were 
related to stature, then a selection of stature would result in men of long femur 
having a short humerus, and vice versd. 

Illustration IV. — Suppose the correlation between greatest length and breadth 
of the skull to be *25, and between both and the auricular height to be *5. Now let a 

stringent selection, ~ x = y^ , of height take place. What modification will there 

be of the length and breadth correlation ? 

Table II. (a), R I? case (j) gives us at once the result — 

r 33 = -0033. 

In other words, the correlation between length and breadth would be sensibly 
destroyed by such a selection. Thus correlation can be created or destroyed or 
reversed by selection. 

The above illustrations, hypothetical though they may be, will suffice to indicate 
how entirely dependent correlation is upon selection. We must look upon coefficients 
of correlation, in fact, as just as much the outcome of selection as coefficients of 
variation, standard-deviations, or even the mean size of organs. No selection can 
take place, in the sense in which it has usually been understood to take place — i.e., 
by a change of mean and of variability, without at the same time the means, 
variabilities and the correlations of all correlated, but not directly selected, organs 
being varied. This is true whether the non-selected organs be initially correlated or 
not among themselves. We must always bear in mind this all-important funda- 
mental conception, that natural or artificial selection, or even random sampling, are 
in themselves active factors in the modification (i.e., creation, destruction, or reversal) 
of correlation. Thus not only is the impossibility of the constancy of correlation 
for local races obvious, but the primary importance of insuring that our samples are 
representative, and not accidentally selected samples, in all observations or experiments 
on heredity, homotyposis, or organic correlation becomes more and more manifest. 
We must not lay too much stress on two heredity constants — differing, for example, by 
more than the probable error of their difference— unless we are convinced, which 
practically it will be difficult to be, that all modification of correlation by unin- 
tentional and unmarked selection has really been avoided. 

(7.) Let us now take the next most simple case, if A, B, C, D be four mutually 
correlated organs (in either the same or different individuals), and a selection take 
"place of A and B, to find the changes in the characters of the non- selected 
organs. 
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Let subscripts 1, 2, 3, 4 mark the organs A, B, C, D respectively; let h l9 h 2 , 
s i> s ?j Pn ^ e ^ ie constants which determine the selection of A and B; and let us 
apply (xlv.j to (xlvii.). Here the coefficients will be obtained from, the set for ^ = 2 
in (xxxiii.) and (xliv.). Hence we find : 
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(lxii.). 
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(lxiv.). 



Finally, for the change of means of the non-directly selected organs, we have : 
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j __ 
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^^13 7 12^23 ^3 
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(lxv.). 
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(Ixvi.). 



If we write (lxv.) and (lxvi.) in the form- 
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1 



x 3 — - Pis * ' l \ + Ps3 r 1 % 

^1 ^3 



. (lxvii.). 



the fis are the partial regression coefficients, and the whole solution can be expressed 
in terms of them. Thus : 



v 3 

^lL ...I e * 



/NO / \ O 



\ ^1 / \ a 2 / ^1^3 



2 / q \2 






^3^4%1< — 0-3°"! 1 r 8i« ~*~ Pi e 6 T U ~~ P23 r 24 + Pi3Pl4 ( — ) + P%$Pm 



2 



^ 



°1 / \ °3 



+ Pia ( ftsAa + £23 A 1) ~r i ( lxix -)- 



°1°3 



Sj£ 3 t 18 — 6^0-3 « p 13 — + P13P23 ~f (lxx.). 

I ^1 °2 J 

Thus the whole series of results can be easily calculated, if the regression coefficients 
are first calculated. 

I may make some remarks upon these results. A formula equivalent to (lxviii.) 
was first given by me in my memoir on "Heredity, Panmixia, and Regression" 
('Phil. Trans/ A, vol. 187, p. 303), and used for certain problems of inheritance, and 
conclusions drawn from (lxix. ) or (lxx. ) have been cited or indicated in other 
memoirs. 

Some interesting results follow at once. If the selection be very stringent, s^cr 
and sja- = sensibly, then all correlation between a selected and non-selected organ 
is destroyed. But 



A* 

if': 
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!k 0- ~ V) - *Wi4 - Mm + r B (Mk + r i4%) /] YX : \ 



\/(l - ?*i2 3 - V ~ V + 2 ?WW v/(l - r 18 3 - r 2 / - r w s + 2r 13 r M r M ) 



This is what I have termed a partial correlation coefficient — i.e., the correlation 
between C and D when fixed values are given to A and B. So far as I am aware, 
such coefficients were first directly used by Mr. G. U. Yule in certain economic 
problems. # They are of very considerable interest, but for natural or artificial 
selection are not quite so important as the generalised form (lxix.), for we generally 
select about a mean value, and not absolutely at it. 

It will be noticed that the coefficient of correlation of two non-selected organs 
differs from the corresponding partial correlation coefficient by terms of the square 
order in s/<x, but the coefficient of correlation of a selected and non-selected organ 



* 



'Eoy. Soc. Proc.,' vol. 60, pp. 185, 188; 'Economic Jounifcil/ December, 1895, and December, 1896. 
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differs from zero by terms of the first order in s/cr. Hence, when selection is intense 
(s/cr small), we may neglect the former as compared with the latter, and we have 
thus the basis of a method of approximation very useful in some cases. 

I have not yet succeeded in giving a geometrical interpretation to the above 
formulae, but have little doubt that it would be quite easy if the " spherical " 
geometry of four dimensioned space were more familiar to me. It will suffice to 
observe that it is easy to find cases in which the correlation of a non-directly 
selected organ with a directly selected organ, or with another of its own class, can 
be reduced, destroyed, increased, or reversed. In fact, all our previous warnings as 
to the caution necessary in avoiding unintentional selection in collecting material for 
testing correlation remain in force, and, indeed, are emphasised. 

The following illustrations will indicate the kind of problems which may be 
attacked by such formulae as the above :— 



(8). Illustration I. — Influence of a Selection of the Long Bones of the Leg on the 
Long Bones of the Arm, and on the Relation of the Leg to the Arm. 



A numerical example will throw light on the application of the above formulae, and 
effectively illustrate the manner in which a selection influences size, variation, and 
correlation. 

Consider the long bones femur, tibia, humerus, and radius, indicated by the 
subscripts 1, 2, 3, 4 respectively, and let m i? m 2 , m 3 , m 4( be the mean values in 
centimetres. Then the following numerical values are given in a memoir by Miss 
Alice Lee and myself: — # 





French <£. 


Aino <$. 


nil 


45-23 


40-77 


m 2 


36' 


81 


33-89 


m 3 


33* 


01 


29-50 


m± 


24* 


39 


21-55 


&i 


2 


37 


1-90 


V-2 


1 


80 


1-67 


0*3 


1 


•54 


1-34 


0- 4 


i 


17 


1-06 


?12 




•806 


•827 


?"13 




•842 


•858 


?'14 




•744 


•789 


r >m 




•860 


•745 


*24 




•780 


•865 


?*34 


•845 


•776 



* ' Roy. Soc. Proc.,' vol. 61, p. 343 et seq. The correlations are not worked out for exactly the same 
lengths in the case of the two races, but the numbers will serve quite well for the purposes of illustration. 
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Now let us select from the French population a group having the same 
characteristics of the long bones of the leg as the Aino population, and then compare 
the characteristics of the arm bones of this selected group with those of the Aino 
population. 

Our selection is given by : 



K = 


= 40-77-45-23 = - 


- 4:46, 


h = 


= 33-89-36-81 = - 


_ 9-Q9 


s x = 


= 1-90 pn = 


•827, 


So = 


= 1-67. 





The following constants must now be determined arithmetically 





^iM 


= 1-90/2-37 = 


= -802, 






i 


= 1-67/1-80 = 


= -928, 




'"is - 

1 - 


'12' 23 ___ 

/}■> 2 

'12 


•426, ? -^ 


'12' 13 „ 

_ />■» 2 
'12 


= -518, 


r u- 


_ Of 4"» 

'12' 24 
'12 


•Q9Q ?a4 ~~ 


'lS'M _ 


= -515, 


1- 


6ZJ, 1 _ 


-V 


1 


- r u ~ *> 


23 '13 ^ ^'12' 


13' 23 _ 


■196, 






1 - *V 




1 


- r m ~ * 


'u - r u + 2 V 


84' 14, 


OO^ty 






1 - r^ 




Of /I , <** ' 

' 34 \ x ' 12 


/ ' 13' 14 


~ V24 + »'lS ( r l3»*24 + »14 r 8s) 



-*■ ~~ r !2 

If x\ and #' 4 be the mean humerus and radius of a femur-tibia selection from the 
French population, we have from (lxv.)— 

a/g = 33*01 + -277/^ + -443/i 2 , 
a/^ = 24-39 + '162,4, + -335A 2 . 

These would give the effect of selecting any femur and tibia defined by h x and h 2 
from the mean values of the humerus and radius. For the particular selection 
indicated above : 

x\ = 30*48, x\ = 22*69, 

both of which are about a centimetre in excess of the Aino population. By selecting, 
therefore, from the French, a population with a mean leg like the Aino, we should 
still find the average arm of this population some two centimetres greater in length 
vol, cc. — a. f 
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than the Aino. The variabilities S 3 and % 4i of humerus and radius for a population 
selected from the French by femur and tibia are obtained from (lxiii.) and (lix.). 
We have : 



2/~ 2 



'3 /^S 



'196 + '181 



2 



(T 



+ '268 



i / 



* 8* X 3 



, o- 



— — y-™ 



?. 



S-i Sq 



OS (T c 



2//<r/ = 1354 + -108 (- L ) 2 + -265 i h \ 



/ e \2 g S 



2 



°"l °s 



P\2 X '221, 



P12 X '169. 



These give for the particular case : 

,2^ — - L o./j 



? — I * I I 



Turning to the correlation of humerus and radius, we have by (lx.) 



'34 






•125 + 140 






' So V2 



+ -267 (-M + -389 v^- 1 



*>i 5c; 



<r x <r 3 



>-' 



iving for the particular case ; 



'34 



•7QQ 



It will thus be seen that if we selected from the French a group with the same 
variabilities and correlation of femur and tibia as the Aino, the variabilities and 
correlation of the humerus and radius of this group would not be very different from, 
those of the Aino. On the other hand, the correlations between upper and lower 
members would be very significantly different. 

Generally we have by (lxi.) for selection from the French : 



r 18 = 5 ! { -426 J +'518pM 



r 



U " 



or* 



% 



S 



•329 + 9 515 p 

(J\ 



So 



CTc 



.2 , 



>^ 



•515 " 3 - + -329 p-- 1 1- 

ff 3 o-i J 



These yield for our particular case : 



**13 



•819, 



to. = *845 



0-2 



%4< 



•694, 

•768. 



These are all smaller than the corresponding French values, the selection has 
reduced the correlation, but the Aino population has in all the cases but r 23 a greater 
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correlation than the French. We must accordingly conclude that by a leg selection 
from the French aimed at reproducing the proportions of the Aino leg, we should not 
obtain an arm equivalent to the Aino arm. The divergences are indicated in the 
accompanying table : — 



Selection from the French. 



Aino. 



Mean humerus ... . 

„ radius 

Variability of humerus .... 
„ radius .... 

Correlation of humerus and radius 

femur and humerus 
femur and radius 
tibia and humerus 
tibia and radius . 



55 



?; 



5? 



55 



?? 



1> 



>> 



V 



30 

22 



48 
69 
39 
11 

799 
819 
694 

845 
768 



29 

21 

1 

1 



50 

55 

34 

06 

776 

858 

789 

745 

865 



Unselected 
French. 



33 

24 

1 

1 



01 
39 
54 

17 

845 
842 
744 
860 
780 



There is, of course, no special reason for supposing that the French and Aino 
differ merely by an evolution which has acted by selection of femur and tibia. We 
might have obtained a race out of the French more nearly akin to the Aino by a 
selection of femur and humerus, but the process would numerically be exactly 
similar. The particular illustration here chosen is taken merely as an instance, to 
indicate how the methods developed in this memoir enable us to ascertain with 
quantitative certainty how far racial differences may be due to the more or less 
stringent selection of a limited number of organs in the one race. 

If we consider that local races have been differentiated from a parent stock by 
the selection of the chief or more markedly divergent organs, then we have in 
processes such as that just illustrated a method of ascertaining, at least tentatively, 
whether two races are to be considered as merely local varieties, and further the 
particular organs through selection of which the differentiation has taken place. 



Illustration II. — Influence of a Selection of Femur and Humerus in Modifying 

Stature. 

The following data have been calculated for me by Miss Alick Lee from Rollet's 
measurements on the French : — >x< 



* They have been undertaking, with the view of determining more scientifically than appears to me 
yet to have been done, the mean stature of a race from a measurement of the long bones found in burial 
mounds, &c. Bollet's measurements are given in < De la Mensuration des Os longs des Membres/ 
Lyons, 1889. I hope shortly to publish a memoir on the subject. [The memoir in question was published 
in < Phil. Trans./ A, vol. 192, pp. 169-244, 1898.] 

F 2 



86 



PKOFESSOK K. PEARSON ON THE INFLUENCE OF NATURAL 




Mean stature, mi , 

Variation of stature, o^ . . . . , 

Mean femur, m^ . 

Variation of femur, cr 2 

Mean humerus. ra 3 ...... . 

Variation of humerus, cr 3 .... 

Correlation, femur and humerus, r 23 . 
,, stature and humerus, r^ 

„ stature and. femur, r 12 . 







o 


? 


166*26 cms. 


154*02 cms. 


5-50 ,, 


5*45 „ 


4.D * Jjo ,, 


41*57 „ 


Zi o ( , , 


A ' /Jo ,, 


33-01 „ 


29*77 „ 


1 * 54 „ 


i. * o j , 


•842 


•872 


-809 


•771 


•811 


•805 







Now let us select from among French males a group having the same variability, 
correlation, and mean size of humerus and femur as French females, and let us ask 
how this would alter the variability (5^), mean size (Mj) of stature in French 
males, and also the correlation between stature and humerus (t 13 ) and stature and 
femur (%). 

We have at once from the second column — 



h = 



3-66 



h 



3*24. 



6\ 



Li ZDj 



So 



I "53. 



whence we find, 



P23 



= '872 



'12 '23' 13 



1—7 



•447, 



r 



13 



J 23 ' 12 



23 



1 



•433. 



r. * 






From (lxv.) we deduce, 



Mi = 166*26 + l*Q37/? 2 + l'546/* 3 . 

This formula gives the stature of any group of males selected from the French, 
and having their femur and humerus respectively h % and h 3 centims. longer than the 

average. 

For the special selection referred to above, h % = -— 3 # 6G and h s = — 3*24, hence 



M x = 166-26 - 3'79 — 5 '01 = 157 8 46. 

This example shows us that if we selected French men with the same femur and 
humerus as French women, it would be the selection of the humerus which would 
contribute mostly to the reduction of stature — a somewhat singular result. Further, 
such a selected group of French men would be still some 3 J centims. taller than the 
average of French women (instead of about 12^ centims.). Probably had we 
selected the tibia as well, the greater portion of this remaining advantage in height 
would have disappeared. 
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To find the variability in stature of the selected group we must use (lviiL). We 
deduce : 

V = erf I -287 + *200 (**)* + '187 (^Y + '387 -^ pi. 

In our particular case this gives : 

Sj = a } X '987 = 5*43. 

The actual variability in stature of French women is measured by a standard 
deviation of 5*45. Hence our selected group of men would be sensibly equally 
variable with French women, as far as absolute variation is concerned. 

Lastly, from (lxi.) : 

•447 - A + '433 ^p^ 



°i 



% = y i 

^l L 



<J% ov 



>• 



? 



r 13 = J 1 J -433 -* 3 + -447 ^ ^3 
which give in our particular case : 

r ]2 = 5' X '8011 = -811, 

r ls =^ X -8017 = *812. 

Such a selection, therefore, would accordingly only increase insensibly the corre- 
lation between stature and humerus, while leaving that between stature and femur 
the same. The sensible reduction of correlation between stature and humerus (*809 
to *77l), which is found as we pass from male to female, does not arise when we 
select a group of males with their femur and humerus of the same length, variation, 
and correlation as those of the females. 

Illustration III .—Influence of a Selection of Stature in Modifying Femur 1 

and Humerus. 

Let us select a group of French men having the same height and variability in 
height as French women, and calculate the changes which will arise in their femur 
and humerus. 

Here the selection is given by : 

J h = - 12*24, s x = 5*45. 

We now need only the earlier formulae of this memoir. From (1.) we find 

M 3 = m 3 + *349/i 1? M 3 = m 3 "+ *227A L . 
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These give for our case : 

M = 40*95, Mo = 30-24. 

Thus a group of males, selected to have the same stature as the females, would 
have a slightly shorter femur and a slightly longer humerus. A slightly longer femur 
in woman and a slightly longer humerus in man would thus appear to be sexual 
characters. 

Turning to the variations, these are given by (xlix.). We find : 

This shows us that while the selection would give the same variability of humerus 
to the men that women have, it would fail to produce the reduction of variability in 
the femur, which is characteristic of the women. 

From (li.) we deduce 

t 12 = -808, % = '806, 
while from (Ivi.) we have 

too = '840. 

Thus we see that very small changes would be made in the correlations, stature 
and femur, stature and humerus, and femur and humerus, if we selected French men 
to have the same size and variability of stature as French women. The explanation 
of this lies in the nearly equal absolute variability of the two sexes with regard to 
stature, for, as we have seen, it is the selection of variability which modifies correlation. 
Looking at the table of values on p. 36, we see that the largest difference of 
variability in the two sexes lies in the femur, and accordingly it is from a selection of 
femur that we should expect the greatest differences in the variability and correlation 
of the two sexes to have arisen, but even this difference alone would not account for 
the observed sexual differences in the correlation. Indeed, it would be surprising if it 
did, for the selection of other organs, notably the pelvis, must have played a con- 
siderable part in the differentiation of sex. # 

(9.) I shall now proceed to a series of problems, which will show the application of 
results, such as those obtained in this memoir, to questions which arise in dealing 
with inheritance and selection. If we suppose a general population to have statistical 
" constants," which remain constant at any rate for a moderate interval, we still want 
to know not only the error which may arise from a random sampling, but also the 
sort of effect which results from our sample being too much drawn from one kind of 
environment, from a rather limited class, or from any other practically necessary or 
unconsciously introduced limitation of the random character of our sample. 

* " Primitive man and woman are more nearly equal in size, variability, and correlation than highly 
civilized man and woman " ('Eoy. Soc. Proc./ vol. 61, p. 354). 
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Illustration 1. — To find the Influence on the Intensity of Parental Heredity of 

the Selection of Parents. 

Let the subscripts 1, 2, 3 refer respectively to father, mother, child. Let us first 
select one parent — say, the father — very stringently, i.e., s i /cr l = /^ is small. Then we 
need only equations (xlix.) and (li.). These give us : 



^13 



/Vis 



an 



d 



The first may be written 



tf = <r !i *(l-(l-tf)r li *). 



X 



" M3 



V r 13 2 + ~ 



Ml 3 



or, we see that r 13 will decrease, as /x x decreases. Thus if r Y6 = *4 we have for 

^ = 1/2, r 13 = '2132, 2 3 /a- 2 = '8367, 

^ = 1/4, r 13 = '1085, t 2 /a 2 = 7906, 

ft! = 1/8, ti 3 = '0545, 2g/cr 2 = 7786. 

It is clear, therefore, that the correlation of parent and child will be much reduced by 
such a selection. On the other hand, the regression coefficient will not be altered, i.e., 
^2%Ai ^ cr z r izl cr \> as we bave seen. Hence in problems of heredity, where we 
suspect a parent to have been highly selected, we should seek for the regression of son 
on parent rather than for the correlation. Thus in the case of Basset Hounds,* some 
if not all the reduction in correlation between sire and offspring may be due to 
selection of the sire. A test of whether the reduction in correlation is due to 
selection of a parent ought to be given by a comparison of s x and S 2 . We cannot, 
I think, suppose, unless natural selection be very stringent, that cr 2 differs much from 
o" v Hence it follows that tjs l ought* generally to be large, if there be selection of 
a parent. We can hardly test this point effectively in the case of the Basset Hounds, 
owing to the nature of the classification. In racehorses, although the sire appears 
to be far more selected than the dam, there is not a great reduction of the coefficient 
of correlation between sire and offspring, f s x appears to be less than 2 2 , but not so 
greatly and certainly less, that we can be surprised that the correlation of sire and 
offspring is not much less than we have found it for material in which selection of the 
father is certainly far less marked. We must accept the warning as to the reduction 

* 'Roy. Soc. Proc./ vol. 66, p. 157. 
t See 'Phil. Trans.,' A, vol. 195, p. 93. 
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in correlation produced by the stringent selection of one parent, but we must 
remember the complexity of the factors — the variety of other influences at work in 
selecting and modifying selection — before we lay much stress on this source of 
alteration in parental correlation. 

Now let us deal with the case of both parents selected, and suppose their selection 
given by s-J^ = /x 1? sjar% = /x 2 , and the change of their coefficient of assortative 
mating from r u to p 13 . We have from formulae (lxviii.) and (lxx.) by a little 
rearranging 



S _ _ 2 



<*$* U - As 3 (1 - ^) ~~ & 3 3 (1 - /x./) — 2 (r m - pi^/ig) &As} 



^3^13 — °"3 t r l3 ~~ ~~ H'l/PlS ~"~ V 12 "~~ PisP'a) H'2'c 



23) 



\ lX.Xlla 7 < 



where £ 13 = (r 13 ~~ r 23 r 13 ) / (1 - r 12 s ), /3 S3 = (r^ - r 13 r 13 ) / (1 - r la s ). Now let us take 
special cases to bring out points. Let us suppose r 12 = 0, or no assortative mating to 
exist, and let us enquire what change would be made in parental correlation if we 
selected parents who had assortatively mated, without altering their variability, i.e., 
let us take /x x = 1, /x 3 = 1, we have at once 



s 2 = vz (1 + Wis^ss) 



• « • • * 



S 3 X* 13 __ cr 3 (r i3 + p v f- 2 v • 



whence we find for r ]3 = ?* S3 = *4 and 



9* 



(lxxiii.), 
(lxxiv.), 



Pis = 


JL. * 


PlS ^ 


% 


Pis = 


•3, 


PlS = 


•4, 


Pis = 


•5, 


Pis =: 


1, 






3 



(7\ 



1-0149, 


X 13 = -4335, 


1-0315, 


= -4653, 


1-0469, 


= -4967, 


1-0621, 


^— Zi t X , 


1-0770, 


= -5571, 


X 1 Tt O «/ » 


= -6963. 



Thus the general effect of assortative pairing of parents is to increase the correlation 
between parent and offspring sensibly, but not to very rapidly increase the variability 
of the offspring. Thus marriages within a class would, if heredity statistics were 
collected for a class, tend to show increased parental correlation. Very high assorta- 
tive mating no doubt occurs with some forms of breeding, and we may well find in 
such cases higher values of the parental heredity than we should obtain for a popula- 
tion of the same species with random mating. I think this may be an effective factor 
in the raising of the parental correlation in the case of coat-colour in thoroughbred 
horses. 
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Now let us see what happens if we select both pare 
let us take /x x = # u 3 = *8 and '5, and for the extreme 
We have at once 



s moderately* As test c^ses, 
0, 



Hence we deduce 



Pl2* 



1 

2 
3 
4 

iJ 



pi ~ IH 



yo id 

9622 

9727 

9936 



1 



S 3 = cr 3 \/'68 + '32/x x 2 (l + p^) . 



^13 = ^s {'4/^1 (1 +/>is)} 



•8. 



3700 
3991 

4277 
4556 
4831 
6131 



» > s o « I i../S,A V • 7 j 

t » t « • flXXVI, J, 



/y 4 = 


= /x 2 ^ 


•5. 


H =* ^2 s 


= 0, 


1 


2 3 /o"3. 




?I3. 




ria- 




•8764 




•2865 


•8246 







•8809 




•3093 


•8246 







•8854 




•3316 


•8246 







•8899 




•3535 


•8246 







•8944 




•3750 


•8246 







'9165 




'4762 


•8246 








This table Is very instructive. It shows us that selection and assortative mating 
are factors of opposite influence; that if selection be only moderate, then with 
considerable assortative mating the coefficient of parental correlation may be 
increased, but if selection be stringent, then assortative mating cannot counteract, 
even if as high as *5, its destructive influence on parental correlation. 

For example, if we take parents remarkable for some intellectual or physical 
character, say with a variability only a very small fraction of that of the general 
population, then, however proportionately we might pair them, we should find their 
relationship to their children, as measured by the coefficient of correlation, very 
sensibly reduced below that of the general population. I think we have here the 
reason why Mr. Galton's Family Data, which were drawn from a rather narrow 
class, and had only a small coefficient of assortative mating, give so much smaller 
parental correlation than my own Family Data, which seem to me drawn from a 
wider class, and have a considerably higher assortative mating, # 

It will be clear that with factors like assortative mating, natural selection, artificial 
selection of breeders, unconscious selection of material from one class or one environ- 
ment, modifying our coefficients of heredity in one or another direction, we can hardly 
hope for more in practical statistics than an approximation to the strength of the 
pure inheritance factor by dealing with the average of as many races and characters 
g,s possible. 

* The work for Mr. Galton's Family Data is given, 'Phil Trans.,' A, vol. 187, p. 270. My own 
results are as yet unpublished. The average value is about -45, as compared with Mr, Gat/ton's '34 
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Illustration II. — To Find the Influence of Parental Selection on Modifying 

Fraternal Correlation. 

Let the subscripts 1, 2 represent the parents, and 3 and 4 two of their offspring. 
Let us first select one parent only, the selection being given as before by s 1 Jg' 1 = /x t . 
Our formulae will now be (liv.) and (Ivi,). So far as the change in variability is 
concerned, we have already discussed it under our first illustration, so we need only 
consider : 

^"v/tr-^-^^ ' • • Uxxvu.;. 

Now r rs = r i4 , if the offspring are of one sex ; hence : 

34 18 \ r^\ ' /I ' * * * \ 

1 ~~ r 13 - (1 - /v) 
If we take r 13 = "4 and r 34< = '5 as reasonable values, we have 

*04 -f* ' LUL61" /I • \ 

r~— ™ _„ , ^ i I "V"V"1 "V" I 

°* -84 + '16/a^ v ; 

Thus t 3i will be greatest when /x x is greatest, i.e., when there is no selection, and 
will decrease with ^ until it reaches "4048, when ja 1 == 0, or there is selection of 
fathers of one value of the character only.* 

The selection of one parent only does not, therefore, immensely modify the 
correlation of brothers. Still, if we work sensibly with one class of the community- 
say, men of genius — we should expect to find their sons rather less like each other 
than if we worked with the general population of brothers. 

Now let us select both parents. Here again the variability of the offspring has 
already been dealt with. We are concerned with equation (Ixix.), and we shall put 
r 13 = r u = r 23 = r u = r, or make parental influence equipotent for the two sexes. 

~! — T 

.jLJL t?lxv_> \j 

fS n <=z p^ z=z J3 U = p %% — p r= . -— ■- ? 

where r is the parental correlation, and r n the coefficient of assortative mating. 
Hence we find 

To reduce to numbers, suppose /x x = /x 2 , and r j2 = for the general population. We 
have 

«. _ ^T- 2 !!.!. 1 r ^ild + ^ /i TYY ; \ 

**- irwii^a^)}' ■ (1XXXL) 

* In general the value of fa ranges from r 3+ down to ~- — - 9 - * 
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Hence if we put r S4 



'5, e 



'4, 



r. 



34. 



•18 + -32^( 1 + fta ) 
•68 + -32^(1 +^9) ' 



(Ixxxii.). 



The following table will suffice to indicate the changes which take place, when we 
give a series of values to /^ andp 13 . Thus the first row gives the influence of selecting 
parents without any assortative mating. We see that with increasing stringency of 
selection the reduction of correlation is very considerable, and that with such selection 
the influence of assortative mating becomes less and less. Nevertheless, assortative 
mating can produce quite sensible results, if there be little or no selection. I am, 
indeed, inclined to think that a good deal of the high values found for the fraternal 
colour correlation in the thoroughbred foals # is due to much assortative colour mating 
in sire and dam. Of course it cannot be all due to this source. 



Values of Fraternal Correlation with Parental Selection. 




1 



To, 



On the whole, I think, we may conclude, so far as the relative influences of sexual 
selection in the form of assortative mating and natural selection go, that : 

Both sexual and natural selection can sensibly modify the intensity of inheritance 
as measured by the coefficient of correlation, the former tends to raise, the latter to 
lower, its intensity. But the effect of the latter, if at all stringent, is to completely 
mask the effect of the former. 

In fact, we may write 

r ' u = * ~ 1 - 2r* ""+ 27V(l +/> lg )" 

Hence the smaller /x- L 3 (l + p L3 ), the smaller will be fraternal correlation. This 
varies as the square of /^ and only as the linear power of 1 + p n . Thus we see at once 
why stringency of selection is far more potent than assortative mating. 

Illustration III. — To find the influence of selecting two organs A and B in 
a parent, on the correlation of the like organs A! and B r in the offspring. 

Let the organs in the parent be denoted by 1 and 2, and in the offspring by 3 and 4. 
Suppose the organic correlation of the two organs in the general population to be r l , 

* ' Phil. Trans./ A, yol. 195, p. 93. 

G 2 
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so that ? , 1S = r 3Jj = r f before any selection takes place. Let r, the correlation of the 
organs in the parent and offspring be supposed to be the same for both organs ; then 
r = r 13 = r u . Finally we have the coefficients of cross-heredity, r 14f and r 33 . These 
must vanish if there be no heredity and no organic correlation, and should be perfect 
if both these are perfect. Hence we will take r H = r 23 = vr f as a probable hypothesis. # 
With these values of the correlation coefficients we easily find 



13 



r — p 2i , po 3 ~ /j lJt — 



Hence from (Ixviii.) and (lxix.) we have : 



e* 0. 



y 



0"3° t 



~ { I — r- + ^Af-M ^jT ^ 1 -L -*- ?" + ?*7VJ * • (Ixxxui.), 



r. 



r'(l -r 2 ) + /V'Vl% 



J> ~ v/{l -"r 3 + rVi 3 } fl - ? + ; r>/} 



ft A 



/ l"V"W"1"\7 \ 



For simplicity, suppose the stringency of the selection to be the same for both 
organs, then : 



X 







>V „L /i it ^ 


M&Qh-m 


7 h Pio/^i 
1 — T- 1 


31- "~~ 


7' 3 

1 !... // ^ 







If r = *4, and p n = y?% 



t% 



4 - 



r X y~T i — < 



« 9 



« { Iaaa Vi |< 



The following table indicates the value of X%Jv :— 





/Al=*l. 


jii = * 8. 


/xi = • 6. 


/Xj_:= *4, 


,, . Q 


fJLl ~ 0* 




y-1 


1 


I 


1 


1 


1 

-A. 


1 




y = 0-8 


'9680 


>9783 


• 9872 


'9941 


•9985 


1 




7 = 0*6 


•9360 


•9565 


•9743 


•9882 


•9970 


1 




7 = 0-4 


•9040 


•9348 


•9615 


•9823 


•9955 


1 




7 = 0-2 


•8720, 


•9131 


•9487 


•9763 


•9940 


1 




7-0-0 


•8400 


•8913 


•9358 


•9704 


•9924 


1 





It will be clear from this table that if the selection be at all stringent, no reduction 
of organic correlation in the parents will affect substantially the organic correlation in 
the offspring. 

On the other hand, if y be > 1, we can have considerable modifications in the 
value of the correlation, even if the selection be stringent* 



* See ' Roy. Soc. Proc.,* vol 85, p. 41 L I havd a good deal of daU on the Value of these crOssdieredity 
correlations now reduced and goon to be published* 
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Thus we have the following values of X 3i /r\ if :-- 



y = 2 
-10 
-50 


fLl=l. 


Hi = * 8. 


Hi = • 6. 


/xi = - 4. 


tt^ = * 2. 


/X] == 0. 


1-1600 
2-4400 
8-8400 


1-1087 
1-9779 
6-3243 


1-0642 
1-5775 
4-1448 


1-0296 
1-2662 
2-4492 


1-0076 
1-0681 
1-3705 


1 
1 



Lastly, if r f = : 



or, XsdpiQ is given by : 



1__ 

4 /v 





/Al = l. 

•1600 


/A X = -8. 


/x-i= -6. 


Mi = ' 4. 


t^i — ' 2. 


/ii = 0. 


Xdi/pi2 


•1087 


•0642 


•0296 


•0076 






Thus, even if there were no correlation between the organs A and B in the general 
population, still a selection of parents in which such organs were correlated would 
lead to offspring with correlated organs A' and B\ The amount of such correlation 
would only be "1600, if the variability of the parent were not selected, and would 
diminish rapidly with stringent selection of variability. Still *1600 is quite sensible, 
and would, if the selection continued for a few generations, continue to increase. 
Thus we see how selection of a pair of organs in a parent may increase or even create 
correlation between the like organs in the offspring. 

The reader will find other interesting illustrations in tracing the influence of an 
absolute selection of one jDarent only on the correlation of the offspring, e.g., relation 
between pairs of foals which all have a common sire, the influence of selecting an organ 
A in the sire and an organ B in the dam on the correlation of the organs A and B in 
the offspring, the influence on assortative mating of selecting parents of men of 
genius, # and in many other problems, 

(10.) It is not without value to consider how p n arises in the case of natural or 
artificial selection. Suppose we have two organs, A and B, then we shall endeavour 

(i.) to give these definite values, say x and y 9 but we shall not be able to get all our 
individuals with such absolute values, we shall select with certain deviations from 

x and y, given by x = x + x! and y = y + y\ say, Further, we shall endeavour to 

* This is a peculiarly interesting case, for if we select men of remarkable intellectual ability, we should 
expect to find both parents above the average of the general population, but with a negative correlation 
between them amounting at a maximum to -» * 1905. 
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rt- 3 



make y some function of as, say y =y (<r) or y + y -=. f{x) + as'/ 7 (a;) + r^ / 7/ (.r) + • • • 

by Taylok's theorem. But y = f (x) and x is small, so that our attempted relation 

will be of the form : 

?/ = mx\ 

Here m is the slope of a line to which we endeavour to confine the selected organs. 
But we shall not be quite able to exactly hit this relation off; when x = e, we shall 
find that y = me + 77, where t? is a small unavoidable error in selection of y uncor- 
related with e. Thus, if s } and s 2 be the selected variabilities, we shall have : 

V = I S ill'") ■= * S (we + ^) 3 , V = i S (e*). 



1 
Therefore : <% 3 = m 3 ^-, 2 + S (rf) t 

ill 



further: p 13 = ' — ' ' — 



•> 



ns l s 2 ns^ S& ,% 

and therefore : - S (t? 3 ) = 5 3 s (1 — Pi/)- 

Or, p l3 is at once obtained from the slope of the line rn, by which we endeavour to 
fix the relationship of the organs A and B. Or, again, we may look upon 
,9 2X /1 — p 12 3 as a quantity measuring the mean divergence of the B organ from that 
absolute fulfilment of the relationship between A and B which we are striving to 
attain. Thus p u is a quantity which naturally arises in every attempt, whether 
artificial or natural, to select organs having a definite relationship to each other. 

Much the same considerations arise when we select three or more organs. In each 
case the selected coefficients of correlation are constants which enable us to express 
(i.) to a first approximation the form of relationship we are aiming at, and (ii.) the 
average degree of divergence from absolute fulfilment of this relationship. 

Thus, without regard to any particular distribution of frequency, the s ? s and the 
ps are the appropriate constants to express approximately the nature of any form of 
natural or artificial selection. 

(11.) On the Probability of Survival and the Surface of Survival Rates. 

In the course of the present paper I have assumed that when measurements are 
made on any population for a complex of n organs, the frequency surface may be 
taken as approximately normal. If this holds for the population before and after 
selection, and measurements made on many groups at different periods of life seem 
to indicate that it is approximately true, it follows that we can determine the form 
of the probability of survival as a function of the means, variations, and correlations 
of the selected and unselected populations. 
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Let the unselected population be given by 

Zi = Zjq expt. -g- -J Gy]X\ "T" ^22**V « • • • I ^nn^'n 

— J— ZC^X^X^ —J— . . . —p ^jC w __i w t^/^_jX w ; [ . . . ( lXXXVl. ). 

Let the probability of survival be given b} r 

/ Y) — — f pr\T \X^ ***"" fCi , fl?o """""" " / 2? 3 """"" 3> * * e ?* ™~" W/ • • (1XXXV11. ), 

where f is at present an unknown function, which is to be a maximum for 
and, if the selection be at all stringent, to take rapidly decreasing values as 

'V - /" rv* ^__ Z" T* - . . /'• r/> .... . /« 

«A/"j A^t , tXyo ** / 0,) tA/o "^H) • • • «* / ;£ A/^ 

take increasing large negative or positive values. It will be clear then that the 
individuals who are " fittest to survive, 77 i.e., have the smallest death-rate, are those 
whose organs are defined by : 

and fitness generally will be measured by the closeness of the individual to these 
" fittest 7 ' individuals. 

In order to find the surface of survivors, immediately after the selection if growth 
be taking place, # or at any later stage if growth have ceased, we have only to multiply 
Z by p, or : 

z = Z X p (lxxxviii.), 

is what in the earlier part of this memoir I have termed the selection surface. Now 
if this selection surface be itself normal, it will be of the form : 

z = z expt. - i {b n (x ± - h } y + b^ (x % — h 2 ) 2 + . . . 

+ Kn (x fl — h n f + 26 ia (x 1 — b x ) (x % — b 2 ) 

Here, as in the value of Z, all the constants b u , & 325 . . . b nm b l2 . . . b n ^ ln are known 
in terms of the variations and correlations. If there be selection of q organs only 
out of the n> then b g+hg+l . . . b um b Lq+l , b %g+l9 . . . b n _ hn , will all be zero. Since by 
Equation (lxxxviii.) p = z/Z, it follows that the function f which defines the prob- 
ability of survival must be of the normal exponential type, or 

* I propose to deal in another memoir with the important problems of slow selection during rapid 
growth, and of secular selection during several generations, 
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f(x } — Jc D x, 2 — fa, . . . x n — &,,) = expi — l {a u (as, — ^) 2 J r a m (x 2 — i 3 ) 3 + . . . 

Thus, to determine the probability of survival, we require to know the values of 
the as and Jcs in terms of the &'s, A's, and c's. The shortest method of finding j ) o * m 

to put x x = k v Xo = L5 • * • ^ ^ hn an( l then note that : 



a,-, «— ^. — i — i — ^ — ^ J-. ± ~___ '.L l *V"f»"] j 

/-^ \ ! ^'l — ■- '^] s «'p -~— 1^7) ) « . . fVjj -v— /t/M I 



Since p = 2/Z, and # = _p/Z l 3 

we see that the relations for _p, given 2; and Z, and for 2, given jp and Z 5 are cyclicly 
interchangeable if at the same time we change c n to — c n , c% to — c 2a . . . c 13 to 
~ ^12 • • • 0„_ lf „ to — c^_i irf . If (T 1? <x ; >, . . . cr n be the standard deviations of the 
unselected population, this amounts to changing o-j, <x 2 , . . . cr n to V — lo"i, V — lo-o, 
. . . \/— lcr« respectively. Thus the results which, give the probability of survival 
in terms of unselected and selected populations can always by an easy interchange 
be used to obtain the selected population from a knowledge of the unselected 
population and of the probability of survival 

Let the unselected population be defined by m x , m. 2f . . . m U3 cr l5 <r. 29 . • . <r m and r 12 , 

Let the selected population be defined by m } + H l5 m^ + H 2 , . . . m H + H«, 2q, So, 

f . , 2j 119 and 1*^0? ^13 j 3*235 e • * ^u—1,ip 

Let the constants of the probability of survival function, or a n , a 22 , . . • a y/;/3 

(% . . . a> n -i,u> bo expressed by s h &>, , . , s„, p I3? p l3 , . . . p„_ l(W as if it were a normal 
correlation surface.^ 

Then the problem will be solved, if we know the i's, s's, and ps in terms of the cr's, 
r's ? H ? s ; S's ? and r ? s. 

Equating the squares, products, and linear terms in the ft's in the equation 
p = ^/Z, we have at once the system ; 



'/.'?* ? 



(xcii. 



for all values of w and ^ from 1 to r^ 

„. /•/ Z» »^™ /v, Z- — ««. /y, /-• «-». ^»= ft, h 

^*^ ^^t?l 1 ^t^'^'j « « 9 Vt wlvi': 8 8 6 ^l'fl fl U 

= — J^Hj — & r2 H 2 -—.♦.-— &(. y l:b —-.,.'— &«j»H« . . (xciix.), 
for all values of '?; from 1 to n. 

* These must not be confused with the Si, s.>, . . . %, pi^ Pm P22, • • • pq,q-i constants of the q selected 
organs of the previous discussion. The new quantities may be in part imaginary. 
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If we now substitute from the first two equations for the a's we find : 



(c rl — & rl ) h x + (c r2 — bro) k, + . . . +(c rv — b vv ) h v + . . . + (c m — b m ) h n 

= — & ll H 1 — &^Ho — . . . — ?> ra H y — . . . — & Wi H„ . . (xciv.). 



Now let A be the determinant 



n 



C 21 



% 






c 1f 



13 



'Ort 



O.v 



^12> 

Jo.!, 



13 



Co 



23 






&13> 
&23> 
&33> 



• » 



• • 



• 8 



Cm 



c 2 * 



c. 



?>H 



C/a 



VnU C„2 *— n 2, C'uZ "~"~ ^w3? ^ 



/w 



and A W1 , the minor of its uv th constituent. 
Then we have at once : 






b 



O't 



b 



nn 



A x h — A,! (?> u Hi + ^lA + . . . + &uH s ) 
+ A« (?>2jHi + &ss H 3 + . . . + b, u ll n ) 

4* ^,,3 (&31H1 + &3:jHa + &33H3 . . . + &3»H„) 



Or: 



+ A„ t (6„,Hj + 6„ 2 H 3 -f . . . + ^««H„). 

A x 4, = Hj (6 u A„i + 6 12 A, 2 + 6 13 A, 3 + . . . + &„£,.») 
+ H2 (&2A1 + &22 A « + &23 A «3 4- • • • 4- &2»A M ) 

4" H3 (&3lA'l + ^32^2 4" ^33^3 + . . . + &s«A„,) 

1 ft • e t • »*•••*»*•« 

+ H„ (6 ;/1 A rl + 6„ 2 A r2 + b iV An + . . . + b H) A r >) 



(xcv.). 



Thus h v is determined in terms of H 1? H 2 , . . . H„, which define the maximum 
frequency of survival. 

In a similar manner by making the proper changes indicated above we have : 

A ; X H, = l\ (a n A' rl + c^AVi + a 13 A' r3 + * < • + «i»A' f „) 

*j- fe 2 (C%A rl -f" tf^A p2 "T* ^2sA t.3 -f* . . . H~* #2 W A tv/ ) 
H~ ^3 (<^3lA vl "T ^32A f3 ■+■ C&33A r3 + . . . -J- Of 3 «A r , 7 ) 

+ £„ (a ;a A' r i + a^A'^ + tf„ 3 A' r o — . . . -f a„ n £s! rn ) . . . (xevi.), 



where A / is the determinant 
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(c n +. a u ), (e v > + «!•>), (e l3 + a,,) . . . (c,„ + a-,,) 
(c 3 i + ff<3i), (c ;w + « :w ), (V,,>, + a,,) . . . (o 3w . + a 3// ) 



(c„i + a /a ), (c„ 2 + a ;/2 ), (r„ 3 + a„ 3 ) . . . (V ;m + a„ w ) 

and A' w , the minor of Its w th constituent. 

Now it will be clear from these results that as a general rule it is impossible for k v to 
be equal to H ?; . In other words : The individual most frequently met with in arty given 
selected community, i.e., the mediocre individual, is not the individual fittest to survive. 

It is only in the limiting case of natural selection being so stringent that one type 
of individual alone is able to survive, that the fittest class has a numerical majority 
over any other class of the community. This seems to me an important, algebraically 
almost self-obvious truth, and yet one which is very much obscured by the use of such 
a phrase as the " survival of the fittest," 

Of course, if there be continuous selection, or an environment so stable that the 
probability of survival remains constant for a long period, there will be a gradual 
approach, never theoretically an actual identification of the mediocre and the fittest. 
But in actual nature the environment, at any rate so far as It depends on climato- 
logical conditions, must have a long period as compared with the vital and reproductive 
periods of innumerable forms of life. A hard winter, a drought, a flood, a famine, 
a plague or epidemic of any kind, even if fairly stringent, will rarely, if ever, render 
the most frequently surviving individual identical with the individual who is fittest 
to survive. # Still less will this identity take place in the many processes of artificial 
selection, which are becoming and will more and more become valuable laboratory 
aids in our appreciation of the action of natural selection. The divergence between 
the most frequently surviving and the fittest individual is measured by the above 
formulae for the Ms in terms of the H's. t 

To complete the solution, the as must be found from the equations of type 

a m) = b mj — e, tm and then from the a'& the ss and p s follow by the well-known 
determinants for multiple correlation : see our Equations (xi.) and (xii.). 

Throughout the earlier part of this memoir I have used only the surface of selection, 
but the above investigation will enable us whenever desired to replace it by the 
probability of survival I will illustrate this by obtaining the formulae suitable to the 
simpler cases. 

* We badly want a name for the selection which acts for a short time and rapidly modifies the adult 
population. It is practically the type of selection considered in this paper. It is ejoidemic or catastrophic 
in character. 

t The point is of considerable importance, for more than one influential writer has spoken of the result 
of natural selection as the preservation of the type the mortality of which is least under the given 
conditions. 
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(12.) Case (i.). — Selection of a Single Organ only. 



The original population is given by 



Z = Z<4T*^ 



and the curve of survivors by 
The probability of survival is 
where we easily find, if S/cr = A. 



Q< 






/ 3? T""""^ '29 P yv—k) /is* 



k = H/(l - X 3 ) . . 



* * * * 



s 



S/v i — X 2 . 



ano. 



w 



Po 



Z 



^ 2o* (1 - A ; 







(xcvii.), 

i » t .X.O V ll-JL. I, 
/ yhy 1 



As an illustration consider a selection from modern French peasants, which should 
reduce the mean and variability of their cephalic index to those of the Libyan race. 
French peasants :— 



Libyans : — ■ 

juLexAv^e * 

These arive : 



m = 79786, a = 3-841. 

m + H = 72-938, t = 2"885. 
H ' = — 6-848, X = -7511. 



k- 



15712. 



,v = 4-370. 



Thus for such a change as 7 points in the cephalic index to take place by selection* 
we should have to make the " fittest to survive " of such a ridiculously low cephalic 
index as 64*074, and such a high variation as 4'370. 



VV e lina 



p Q = 51*0474 n/N, 



and accordingly the probability of survival given by 

where N are the number of Frenchmen converted into n Libyans so far as cephalic 
index is concerned. 

I have purposely taken a somewhat extreme case of selection in order to illustrate 
how widely the most frequently surviving individual can diverge from the fittest. 

In this case, if the chances of survival (i.) of the fittest, (ii.) of the individuals most 
frequent after selection, and (hi.) of the individuals most frequent before selection, be 
C 1? C 2 , and C 3 respectively, we have : 

* This is, of course, supposing the change to occur by catastrophic selection and not by a continuous 
secular selection, see footnote preceding page. 

H 2 
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or, the chances of survival of an individual of the fittest type would be about eight times 
as great as those of an individual of the most frequent type after selection and about 
700 times as great as those of an individual of the most frequent type before 
selection, If v l3 v^ v» be the numbers after selection in the three classes* of the 
fittest to survive, the most frequent after selection and the most frequent before 
selection, we find 

v x : v % : v $ : : '00892 : 1 : '05978, 

In other words, the most numerous type before selection is still after selection about 
6*7 times as numerous as the type with the least mortality, and this latter type is 
only about jxt as numerous as the type to be most frequently met with after 
selection has taken place. 

Thus, although there would have been a very great evolution in cephalic index, 
due to a fairly stringent selection, the fittest to survive would always have formed 
but a small fraction of the dominant type. Even if we were to replace the selection 
here considered by a gradual evolution spread over several generations, we should 
still reach in the main the same conclusion, i.e., that natural selection never proceeds 
by the survival of the fittest \ or the survival of those with the least death-rate. These 
will always remain a small fraction of the community—they are the goal, but often 
the very distant goal, to which selection tends to shift the population. 



(.13.) Case (ii.)— Selection of Two Organs. 
In this case let the surface of survivors be : 



►V •—— • <V 







Oi - H ^ _ 2r t3 (x, - H x ) QK, - Hg) (a, - H g )» 

V (1 - r J3 2 ) 2^ (1 - xj) "*~ 2/ (1 - V) 






the original population : 



rp fj ")p rp r-p rv . v 



Z ~ Z ° expt ' ~ M^T^n?) " ^ . (ci.), 

and the curve of probability of survival. : 

p ~ « expt. — -A- i rv7i — ":~o, — =7-77 — -zr^ — - + 117, z~^ V . (en. j, 

Since : p = 2/Z, 

* By individuals of a type or class is meant here, as elsewhere in this section, all the group falling 
within some small definite range of variation lying round a particular value of the organ (e.g., m, m + H, 
or m 4- /.'), which defines the type or class, 
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we find at once 
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Let Si/o-j = X l5 So/o-jj = X^ measure the stringency of the selection, and 

H — a / _ ~ ._„13_ measure the change in correlation. # Then solving the above equations 

j, .-J 

we find : 
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(cxii), 



where 



/3 



o 

'is 



V - V + (i - % 2 ) W + 2r ia r 12 X 1 X 3 - 



Similarly, if the original population and the curve of probability of surviving 
or of survival rates be given, we have to find the selected population : 



tv 



Pn "+" r i2 v ~ K i K 2 



1.3 



v/l + i/V ^/l + *>% 



(cxiii.), 



* If ri2 = cos rf, ti2 = cos D, fx - sin D / sin d. The quantity D lias been conveniently termed the 
"divergence" by Mr. Siieppakd. Hence /x is the ratio of the sines of the selected and unselectecl 
divergences. The above formula for p i2 can be at once changed into one suitable for trigonometrical 
logarithmic calculation. Let sin a x = /xX 1? sin a 2 = //A 2 , and p 12 = cos 3 ; then, if A be the side of the 
spherical triangle, of which a h a 2 are the other sides and S the included angle : 



sin *3 = A /{^I.( D --- A >^iA D -.±.- A )>. 

V t cos cl\ cos a.) J 
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(14.) Illustration. — The following results are taken from the paper by Miss Alice 
Lee and myself already cited : — 



Femur 



Humerus 



French <J. 

. m, = 45*228 centims. 



Correlation of"} 
Femur and K 
Humerus J 



ma = 33-010 



cr 



J- O (3 



?? 



?? 



?? 



■r 12 = *8421 



Aino (J. 
40770 centims. = m x + H. 

1-898 „ = t l9 
29*502 „ = m a + H 2 . 

I t)4o ,, ■ — ■ Zo< 



?? 



•8584 



^12* 



As indicated by the symbols above, let us select from the French a population 
having the same femur and humerus relations as the Aino. 
We have at once : 



Whence w~e find : 
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17-8464. 
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n 



15-8547. 



J)n — — ^UOj-i^D " 5 



2-8736, 



where n are the number of Ainos which can be obtained from N Frenchmen. 
We have accordingly the following form for the surface of fitness to survive : 
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p — 208,425 -^ expt. — f { -222056 (x + T.7'8464) 2 + '326,960 (y + 15"8547) 2 

' — -486,450 (x + 17*8464) (y + 15-8547)]. 

Now it is clear that if we wanted by a " catastrophic " selection to convert the 
French into something resembling the Aino, we should have to give the least death- 
rate to those French with femur corresponding to m 1 + \ and humerus to m 2 -f- &>.> 
or to the dwarfs with femur = 27*382 centims. and humerus = 17*155 centims. ! 
By no other means could we shift the modal value of the French population down as 
low as the Aino modal value. The physical meaning of this is that we have been 
compelled to put on an excessive death-rate for the bigger Frenchmen. 

An interesting point of our work is that 

k x = 1*2514 B } -f 3*4971 H 2 , 
7 % = — 1-0266 H, + 5*8242 H 3 , 

whence we see that while a selective reduction of humerus is far more effective in 
reducing both femur and humerus centres of survival than a reduction of femur, a 
selective reduction of femur occurring contemporaneously with that of the humerus 
actually tends to raise the centre of the humerus, i.e., the coefficient of H x is 
negative. 

Now let us consider the frequency of survivors per unit length, say centimetre of 
femur and humerus, at different points. The surface of survivors, i.e., the Aino 
population, is 

- * I 5 <* ~ Hi> 2 „ 2 *i2 (* - liH?/ ~ H 8 ) (y ~ ggg ) 

If we put as = 0, ?/ = we have the frequency after selection of the original 
population type ; if we put x = H 1? y = H. 3 we have the frequency after selection of 
the new population type ; and if we put x = & l5 y == /%, we shall have the frequency 
after selection of those best fitted to survive. If these frequencies be z> 3 , *> 2 , ^ 
respectively, we find on substituting the numerical values that 

: : -117/10 18 • 



v, : v a : vo : : •117/10 1 '' : 1 : -032289. 

Thus the most frequent type of the new population is now about thirty times as 
frequent as the old most frequent type, while the type most fitted to survive has 
practically no existence at all. It probably lies outside the actual boundary of the 
French population. 

Here really arises the question as to how we are, in any actual problem, to fix the 
ratio of n to N, or, what amounts to the same thing, to fix a practical boundary to a 
given population. Such a boundary must be conventional, but I think that for 
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practical purposes we are quite safe if we assume that an. individual who occurs only 
once per thousand can produce no effect on the physical evolution of the population 
as a whole. 

Now the form of a correlation-surface for two organs, x and y, is 
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similar ellipses which are the contour lines of the surface, or lines of equal frequency, 
i.e., giving individuals with equal probability of occurrence. Let the equation to 
these contour lines referred to their principal axes be 
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Further, if <j6 be the angle the A principal axis makes with the axis of #, we 
have : # 

tan 2<j) = 2rcr ] cr 2 /(cr :1 2 — cr^ 2 ). 

These fully determine the principal axes of the frequency surface* Now consider 
the frequency bstween the elliptic cylinders corresponding to k and k-\-§k; we 
have it 






X 27tABicc?k 



o 



z X 2ttct 1 (j , 2 \/l -— r 3 kc?k: = N<r** 2 KC?/c. 



Hence, if N K be the frequency outside any contour k. 
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For N K to be t<mkF °f N" we have simply 



K* 



6 



log e 



, whence k ~ 3716,923, 



(cxviii.). 



* For easy calculation put y ~ \/o-! 2 + cr 2 2 , tan ^ = o-^h Then we have at once if r = cos D : 

A = y cos X) B = y sin x? 
where : sin 2\ ~ Bin 2^ sin I), tan 2<£ = tan 2^ cos XX 
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This will enable us to determine our conventional boundary to effective population. 
Now let us refer our non-selected and selected populations to their centres and 
principal axes. 

We find for the contour curves : — 





Unselected Population 
(i.e., French). 


Selected Population 
(i.e., Aino). 


Surface of Survival 
(i.e., Rate of Survival). 


x centre (femur) .... 
y centre (humerus) . . 
tan (slope to x) .... 

Principal axes < t> 

1 in 1000 limit (*£" * * 


45-228 
33-010 
•601,3775 
2-7338 
•7197 
10-1615 
2-6750 


40-770 
29-502 
•670,1454 
2-2514 

•5808 
8-3682 

2-1588 


27-382 
17-155 

•807,3371 
4-4115 

•9775 



Referred to its principal axes, the rate of survival is now 



P 



208,425 -J e~**&^ + wW\ 



Suppose we require to get at least 1000 Aino out of the French population, N, then 
n — 1000. Now suppose the Aino limiting ellipse drawn, then the French population 
must be sufficiently large to give the individuals inside this ellipse. Now p gets 
smaller as we go further from, the centre of the survival surface. Hence the contour 
line of the survival surface corresponding to p = 1 must be touched externally by the 
limiting contour of the Aino population, in order that we may get at least 1000 Aino 
out of N Frenchmen. Now, by a graphical construction, I find the major axis of the 
elliptic contour line of the survival surface which touches the Aino limiting ellipse, is 
about 11*44. This gives for the parameter /c, of this ellipse, k } X 4*1149 = 11*44, or 
^ = 2*5932. Whence: 



P 



*1 



1 = 208,425 l0 S° e 



K2*5932) 8 



gives the greatest possible value of p and the least possible value of N. Numerically 
this gives us N = 7,200,000 about, or we should want more than 7,000,000 of 
Frenchmen to obtain our 1000 Aino by a catastrophic selection. The actual bounding 
contour line of this least possible number of Frenchmen # has for its major axis 
15*285 centims., and it touches the Aino limiting ellipse at the point where it is 
touched by the survival contour p = 1. 

Now let us turn the problem round and ask what is the least population of Aino 

* The least possible to reproduce the Aino, as far as femur and humerus are concerned, to 1 in a 1000 
of the population. 
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from which we could produce 1000 Frenchmen by a catastrophic selection. In this 
case the surface of survivals is simply obtained by inverting £>, and, if N' be the 
number of Aino, n = 1000 = required number of Frenchmen, we have ; 

1 n' xi * a , v 2 I 

^ --_ g& 1/4.4115)2 T (-9775)8 J 

' 208,425 1ST 

Here p gets larger as we go away from the centre of the surface of survivals, 
and we must therefore make the French limiting ellipse just touched internally by 
the contour line of the surface of survivors for which p! = 1. The major axis of this 
contour line for p = 1 was found by a graphical process to be about 33'91. This 



pives for k» 

CD ~ 



Ko X (4*4115) = 33*91, or k 2 = 7*6867. 



Thua . n' = 1 — 1Q()Q - — P Hrm7) 9 

/^ K3 x 208,425 N" 7 

leads to N' = 32,460,000,000 about, or we should want upwards of 32,460,000,000 of 
Aino to produce the 1000 Frenchmen. The bounding contour line of this number of 
Aino has a major axis of 15*890 centims. about, and touches the French limiting 
contour in the point in which it is touched by the p = 1 contour of the surface of 
survivals. 

Now the difference between these two unselected populations is very great. We 
see that to get the Aino a very great number of Frenchmen would have to be 
exterminated, about 7000 for each Aino selected ; but to get the Frenchmen from 
the Aino an appalling number of Aino would have to be destroyed, upwards of 
32,000,000 for each Frenchman selected. Even if the selection were not catastrophic 
but spread over centuries and centuries, we must recognise what a large consumption 
of life there must be — individuals destroyed without progeny*- — if we are to suppose 
any highly civilized race like the French produced by selection from an apparently 
primitive type like the Aino. Indeed, the return journey in this case seems much 
easier than the upward ascent. Beyond all this we have only made French and 
Aino alike for two organs, and only for one character of each of them ! Allowing for 
our conventional limit to the population, allowing for the fact that our Aino data are 
drawn from a very limited population of remarkably small variability, it seems very 
improbable that the French have ever been produced by selection from a primitive 
race at all resembling the Aino. The fact that the Aino could be so much more 
easily obtained by selection from the French seems to indicate that they are rather 

* Of course, with a secular selection spread over many generations, it is largely the potentiality and not 
the actuality of life which is destroyed. Still, while the gross number killed among a small primitive 
community may not be large, the death-rate must still be immense. I hope to return to these points 
when dealing with secular selection as distinguished from catastrophic selection. 
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some degenerate offshoot of a race superior to themselves than a sample of the 
primitive people from which the Circassian races may be supposed to have sprung. 

The whole of this discussion is, of course, very hypothetical ; no stress whatever is 
to be laid upon it except as an illustration of method, and a rough appreciation of the 
vast amount of elimination which must be necessary to evolve one race from a second 
in the case of organs which we know by measurement to have continuity of variation, 
and only saltatory changes in pathological cases, which have, as far as we can judge, 
no influence on the mass-evolution which has produced the local races of man. 

But given fair samples of material our method will enable us to determine whether 
a race A — for of course a limited number of characters — could with less destruction be 
deduced from a race B, than the race B from A. It will not therefore follow that the 
path of least selection is that which necessarily was used by Nature. Possibly both 
A and B have been reached by far less expenditure of material from 0. Still it is 
something definite in the midst of our gropings after truth in problems of descent to 
have even a rough appreciation of the amount of selective destruction which would 
arise from alternative suggestions. That is why this special numerical illustration of 
the surface of survival has been given. 

The reader will possibly find the matter rendered somewhat clearer by the diagram. 
The femur .is measured along the horizontal and the humerus along the vertical. 
A is the type or mean femur-humerus of the Aino population. Within in the 
continuous ellipse round A the whole Aino population up to 1 in 1000 would fall. 
F is the type of the French population and the continuous ellipse round F gives the 
area within which up to 1 in 1000 of the French population fall. Since the diagram is 
drawn to centimetres of the bones, it will be seen how very small are the limits of 
variation within both populations. P is the centre of the surface of survivals ; for the 
selection of Aino from French it makes the " fittest to survive." In the case of the 
selection of the French from the Aino, P is no longer the centre of fitness, but the 
" centre of unfitness " ; the Aino are killed off* with an intensity which increases the 
closer we approach to P. Now it seems to me that these two cases, which are quite 
distinct in theory, ought to manifest themselves in Nature and require distinguishing 
names. A race may be modified' because a complex of organs with a certain system 
of values is good for it, or because it is bad for it. The race may be modified because 
a certain element of it is fittest or because it is unfittest to survive. In the former 
case we select for survival round the centre, in the latter case we select for destruction. 
I propose to call these cases positive and negative selection respectively. It may be said 
that if there be positive selection in one part of the population there will be negative 
in another. But the kernel of the matter is in either case the existence of a centre, 
a definite set of most fit or of most unfit organs, while in positive selection the less fit 
organs, and in negative selection the more fit organs are distributed over wide areas 
of the field, and do not reach a maximum of unfitness or a maximum of fitness 
respectively for any definite individual. 

I 2 
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In the diagram we have also drawn the contour line to which the French population 
must extend if we are to get at least a representative population of 1000 Aino from 
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it, and further the contour line to which the Aino population must extend if we are 
to get at least a representative population of 1000 French from it. A consideration of 
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the nature of the contour lines of the surface of survivals shows that the contour lines 
above referred to, and marked "boundary" in the diagram, must touch the Aino 1 in 1000 
limit and the French 1 in 1000 limit respectively at the points in which they are 
touched by the contour lines p = 1 and p f = 1 of the corresponding surfaces of 
survivals. I have already indicated that the major axes of these boundaries are 
15*285 for the French and 15*890 for the Aino. The corresponding values of the 
parameter k are respectively given by 

/15-285\ r r , ,„ 15*890 

Hence by (cxviii.) we can easily find the frequency of population outside the 
contours k f and k a ; if these be v F and v k we have : 

v v = -000,000,163, v K = '000,000,000,015. 

Thus the French population would have to be extended to a boundary in which only 
about 1 in six millions was excluded, and the Aino population to a boundary excluding 
only 1 5 in the billion ! The boundaries of what we may thus term the selection 
populations are far larger than our conventional boundaries of 1 in 1000 for represen- 
tative populations. In fact, it would be impossible to select a representative Aino 
population from a conventional representative French population and vice versa — in 
either case the very exceptional members of French or Aino populations are 
required to complete the conventional representative populations of Aino or French 
by selection. 

(15.) I have devoted most of my consideration of the surface of survivals to 
a particular case in which two organs have been selected, and we consider the nature 
of j) which determines the fraction of each group of individuals which survives. 
I have done this partly because normal surfaces are at best only an approximate 
representation of our selectable and selected distributions, and partly because I have 
thought a concrete case would best bring out the general points of investigations of 
this kind. 

But some little indication of the properties of the surface of survival-rates ought 
to be indicated here, or it may appear that they have been overlooked. While the 
contour lines of the correlation frequency surfaces for two organs must be ellipses, this 
does not follow in the case of the surface of survival-rates. In our illustration they 
were ellipses, but they may be also parabolas, hyperbolas, or even straight lines. 
We must not therefore expect to find always a "centre" of positive or negative 
selection. We may come across a " saddle-back system " of contours with the rate 
of survival constant along two intersecting lines, but rising in one pair of opposite 
angles and falling in the other pair. In this case we have fields of negative and 
positive selection separated by two independent relations between the two organs, 
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which are linear and for which the survival-rate is the same, they may be termed the 
" critical lines/' For one pair of angles the centre is now a " centre of fitness," for 
the other pair of angles a " centre of unfitness.' 7 ft seems to me that these critical 
organic relations may possess considerable biological importance. 

If the contour lines of the surface of survival-rates are parabolas, we have really 
only a limiting case of the centre at a very great distance. It is one in which the 
fittest (or most unfit) has no practical existence, but there is a direction towards 
which the rate of survival will be found to be always increasing or decreasing. 

If the contour lines of the surface of survival-rates are parallel straight lines, then 
so long as the deviation in one organ has a certain definite relation to that in the 
other, the survival-rate will remain constant. In this case the survival-rate will 
fall uniformly in one direction and remain constant in the direction at right angles 
to it. 

All the cases I have given here can occur just as easily as the elliptic contour 
system of our illustration and diagram. Each is marked by quite definite biological 
characteristics, and we may, perhaps, class them as elliptic, hyperbolic, parabolic, 
and linear selection. Even if the surface of survival-rates be not of the exponential 
quadric type discussed in this paper, yet to the neighbourhood of each part of it 
this classification of selection types will apply. 

If we pass to more than two organs, then similar considerations will apply ; we 
shall only be reproducing the geometry of quadric surfaces in space of three and 
higher dimensions. But before we allow ourselves excursions into the higher 
geometry of the surface of survival-rates, it seems desirable that we should obtain 
quantitative determinations of this surface by experiments in artificial selection. 
We shall then be better able to see what part of our geometry will really be of 
service for the problems of natural selection. The field is too large to be cultivated 
for merely theoretical interests. We must first determine what parts of it are likely 
to have practical application to life as we find it, but of death-rates in the case of 
any living form but man, we are at present sadly ignorant. 



SELECTION ON THE "VARIABILITY AND CORRELATION OF ORGANS. 



63 



Key to Selective Correlation. Tables, 
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Formula : 



II' = It + 10 (\U) S/x, + 4 {(A r , R) 8r ]a + (A,,It) Sr 13 + (A,, R) Sr, 3 } 



Occasionally second differences must be used. 
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